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Abstract 

Multivariate interaction between two or more classes (or species) has important consequences in many 
fields and causes multivariate clustering patterns such as segregation or association. The spatial segregation 
occurs when members of a class tend to be found near members of the same class (i.e., near conspecifics) 
while spatial association occurs when members of a class tend to be found near members of the other class 
or classes. These patterns can be studied using a nearest neighbor contingency table (NNCT). The null 
hypothesis is randomness in the nearest neighbor (NN) structure, which may result from — among other 
patterns — random labeling (RL) or complete spatial randomness (CSR) of points from two or more classes 
(which is called the CSR independence, henceforth). In this article, we introduce new versions of overall 
and cell-specific tests based on NNCTs (i.e., NNCT-tests) and compare them with Dixon's overall and 
cell-specific tests. These NNCT-tests provide information on the spatial interaction between the classes at 
small scales (i.e., around the average NN distances between the points). Overall tests are used to detect 
any deviation from the null case, while the cell-specific tests are post hoc pairwise spatial interaction tests 
that are applied when the overall test yields a significant result. We analyze the distributional properties 
of these tests; assess the finite sample performance of the tests by an extensive Monte Carlo simulation 
study. Furthermore, we show that the new NNCT-tests have better performance in terms of Type I error 
and power. We also illustrate these NNCT-tests on two real life data sets. 

Keywords: Association; clustering; completely mapped data; complete spatial randomness; random labeling; spatial 
pattern 
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1 Introduction 



Multivariate clustering patterns such as segregation or association, result from multivariate interaction be- 
tween two or more classe s (or species) . S uch patte r ns are of int erest in ecological sc iences and other application 
areas. See, for example, Pielou ( 1961 ). Whipple! ( 1980l ). and Dixon ( 1994 . 2002[ ). For convenience and gen- 
erality, we refer to the different types of points as "classes" , but the class can stand for any characteristic of 
an observa tion at a part icular location. For ex ample, the spatial segrega tion pattern h as been investigated 
for sp ecies (Diggle ( 20031 )). age classes of plants jHaniill and Wright ( 19861 )). fish species ( Herler and Patzner 
(l2005h ). and sexes of dioecious plants (jNanami et al.l (119991)). Many o f the e pidemiological applications are 
for a two-class system of case and control labels ( Waller and Gotwav ( 20041 )). These methods can also be 
applied to social and ethnic segregation of residential areas. For simplicity, we discuss the spatial interaction 
between two and three classes only; the extension to the case with more classes is straightforward. The 
null pattern is usually one of the two (random) pattern types: random labeling (RL) or complete spatial 
randomness (CSR). We consider two major types of spatial clustering patterns as alternatives: segregation 
and association. Segregation (association) occurs when objects of a given class have NNs that are more (less) 
frequently of the same (other) class than would be expected if there were randomness in the NN structure. 

In stat istical and othe r literature, many univariate and multivariate spatial clustering tests have be en 
proposed (jKuUdorfl llooi)). These include comp arison of Ripley's K(t ) and L (t) func t ions (iRiplevI (|2004[ )). 
comparison of nearest neig hbor (NN) distances (|Cuzick and Edwards! (|l990!) . iDiggld (|2003f )). and analysis 
of nearest nei g hbor contingency table s (NN C Ts) which are constructed using the NN f requencies of classes 
( Pielou! (|l96l! ). lMeagher and Burdick! (|l980! )). IPieloul (|l96ll ) proposed various tests and IPixonl (jl994l ) intro- 
duced an overall test of segregation, cell-speci fic and c lass-s pecific tests based on NNCTs in a two-class setting 
and extended his tests to multi-class case in ( Dixon (I2OO2I) ). 



In this article, we introduce new overall and cell-specific tests of segregation based on NNCTs for testing 
spatial clustering patterns in a multi-class setting. W e compare th ese tests with Dixon's NNCT-tests which 
are introduced for testing against the RL of points ( Dixon ( 1994[ )). We extend the use of these tests for 
the CSR independence patter n also. We also compare the NNCT-tests with Ripley's K or L-functions and 
pair correlation function g{t) (jStovan and Stov"aii ( 1994 )). which are methods for second-order analysis of the 
point pattern. We only consider completely mapped data; i.e., for our data sets, the locations of all events in 
a defined area are observed. We show through simulation that Dixon's cell-specific test can have undesirable 
properties in some situations. The newly proposed cell-specific tests perform better (in terms of empirical size 
and power) than Dixon's cell-specific tests. Likewise the new overall test tends to have higher power compared 
to Dixon's overall test under segregation of the classes. Furthermore, we demonstrate that NNCT-tests and 
Ripley's L-function (and related methods) answer different questions about the pattern of interest. 

We provide the null and alternative patterns in Section [2l describe the NNCTs in Section [3l provide the 
cell-specific tests in Section |4l overall tests in Section [5l empirical significance levels in the two- and three- 
class cases in Sections 16.11 and 17.11 respectively, rejection rates of the tests under various Poisson processes 
in Section [51 empirical power comparisons under the segregation and association alternatives in the two-class 
case in Section [3J in the three-class case in Section 1101 examples in Section 1111 and our conclusions and 
guidehnes for using the tests in Section [T^ 



2 Null and Alternative Patterns 

In this section, for simplicity, we describe the spatial point patterns for two classes only; the extension to 
multi-class case is straightforward. 

In t he univariate s patial point pattern analysis, the null hypothesis is usually complete spatial randomness 
(CSR) (|Digglel (|2003l )). Given a spatial point pattern V = {Xi{D),i = 1, . . . , 71 : £) C M^} where Xi{D) is 



the Bernoulli random variable denoting the event that point i is in region D. The pattern V exhibits CSR if 
given n events in domain D, the events are an independent random sample from a uniform distribution on 
D. This implies there is no spatial interaction, i.e., the locations of these points have no influence on one 
another. Furthermore, when the reference region D is large, the number of points in any planar region with 
area A{D) follows (approximately) a Poisson distribution with intensity A and mean A • A(D). 



To investigate the spatial interaction between two or more classes in a multivariate process, usually there 
are two benchmark hypotheses: (i) independence, which implies two classes of points are generated by a pair 
of independent univariate processes and (ii) random labeling (RL), which implies that the class labels are 
randomly assigned to a given set of locations in the region of interest (IDiggle In this article, we will 



consider two random pattern types as our null hypotheses: CSR of points from two classes (this pattern will 
be called the CSR independence, henceforth) or RL. In the CSR independence pattern, points from each of 
the two classes independently satisfy the CSR pattern in the region of interest. On the other hand, random 
labeling (RL) is the pattern in which, given a fixed set of points in a region, class labels are assigned to these 
fixed points randomly so that the labels are independent of the locations. So RL is less restrictive than CSR 
independence. CSR independence is a process defining the spatial distribution of event locations, while RL 
is a process, conditioned on locations, defining the distribution of labels on these locations. 

Our null hypothesis is 

Ho : randomness in the NN structure. 

Although RL and CSR independence are not same, they lead to the same null model in tests using NNCT, 
which does not require spatially-explicit information. That is, when the points from two classes are assumed 
to be independently uniformly distributed over the region of interest, i.e., under the CSR independence 
pattern, or when only the labeling (or marking) of a set of fixed points, where the allocation of the points 
might be regular, aggregated, or clustered, or of lattice type, is considered, i.e., under the RL pattern, there 
is randomness in the NN structure. The distinction between RL and CSR independence is very important 
when de fining the appropriate null mo del in practice, i.e., the null model depends on the particular ecological 



context. iGoreaud and Pelissien (|2003l ) state that CSR independence implies that the two classes are a priori 
the result of different processes (e.g., individuals of different species or age cohorts), whereas RL implies 
that some processes affect a posteriori the individuals of a single population (e.g., diseased vs. non-diseased 
individuals of a single species). We provide the differences in the proposed tests under these two patterns. 

We tre at CSR indep endence or RL as the main null model of interest, since this is the logical point of 
departure ( Digglel ( 20031 )). However, in the ecological and epidemiological settings, CSR independence is the 



exception rather than rule. Furthermore, it is conceivable for other models to imply randomness in the NN 
structure also. We also consider patterns that deviate from stationa rity or homog eneity in the point process. 
In particular, we c onsider various types o f Poiss on cluster processes (iDiggld (I2OO3I) ) and other inhomogencous 



Poisson processes (iBaddelev and Turned (|2005h ). Randomness in the NN structure will hold if both classes 



independently follow the same process with points having the same support. For example, in a Poisson cluster 
process, NN structure will be random if parents are the same for each class. If classes have different parent 
sets, then the Poisson cluster process will imply segregation of the classes. If parent and offspring sets are 
treated as two different classes, then Poisson cluster process will imply association of the two classes. Further, 
if the two classes are from the same inhomogencous Poisson pattern, again randomness in the NN structure 
will follow. But when the two classes follow different inhomogencous Poisson patterns whose point intensities 
differ in space, it might imply the segregation or association of the classes. 

As clustering alternatives, we consider two major types of spatial patterns: segregation and association. 
Segregation occurs if the NN of an individual is more likely to be of the same class as the individual t han to 



be fro m a different class; i.e., the members of the same class tend to be clumped or clustered (see, e.g.. lPielou 



( 196l[ )). For instance, one type of plant might not grow well around another type of plant and vice versa. In 



plant biology, one class of points might represent the coordinates of trees from a species with large canopy, 
so that other plants (whose coordinates ar e the o t her cl a ss of points) that nee d light cannot grow (well or at 
all) around these trees. See, for instance, ( Dixon ( 1994 ): [Coomes et al. ( 1999f )). 



Association occurs if the NN of an individual is more likely to be from another class than to be of the 
same class as the individual. For example, in plant biology, the two classes of points might represent the 
coordinates of mutualistic plant species, so the species depend on each other to survive. As another example, 
one class of points might be the geometric coordinates of parasitic plants exploiting the other plant whose 
coordinates are of the other class. In epidemiology, one class of points might be the geographical coordinates 
of residences of cases and the other class of points might be the coordinates of the residences of controls. 

Each of the two patterns of segregation and association are not symmetric in the sense that, when two 
classes are segregated (or associated), they do not necessarily exhibit the same degree of segregation (or 
association). For example, when points from each of two classes labeled as X and Y are clustered at different 
locations, but class X is loosely clustered (i.e., its point intensity in the clusters is smaller) compared to class 



Y. So classes X and Y are segregated but class Y is more segregated than class X. Similarly, when class Y 
points are clustered around class X points but not vice versa, classes Y and X are associated, but class Y is 
more associated with class X compared to the other way around. Many different forms of segregation (and 
association) are possible. Although it is not possible to list all seg regation types, its existence can be tested 



igregat 

by an analysis of the NN relationships between the classes (jPielorJ (|l961l )) 



3 Nearest Neighbor Contingency Tables 

NNCTs are constructed using the NN frequencies of classes. We describe the construction of NNCTs for 
two classes; extension to multi-class case is straightforward. Consider two classes with labels {1,2}. Let Ni 
be the number of points from class i for i £ {1, 2} and n be the total sample size, so n = iVi + N2. If we 
record the class of each point and the class of its NN, the NN relationships fall into four distinct categories: 
(1,1), (1,2); (2,1), (2,2) where in cell (i,j), class i is the base class, while class j is the class of its NN. 
That is, the n points constitute n (base, NN) pairs. Then each pair can be categorized with respect to the 
base label (row categories) and NN label (column categories). Denoting Nij as the frequency of cell 
for i,j S {1,2}, we obtain the NNCT in Table [1] where Cj is the sum of column j; i.e., number of times 
class j points serve as NNs for j € {1,2}. Furthermore, Nij is the cell count for cell (i,j) that is the count 
of all (base, NN) pairs each of which has label («, j). Note also that n = J2i ^ijl — 



I Nij-, and 



Cj = J2i=i ^ij- construction, if Nij is larger than expected, then class j serves as NN more frequently to 
class i than expected, which implies segregation if i = j and association of class j with class i \ii ^ j. On the 
other hand, if Nij is smaller than expected, then class j serves as NN less frequently to class i than expected, 
which implies lack of segregation if i = j and lack of association of class j with class i \i i ^ j. Furthermore, 
we adopt the convention that variables denoted by upper case letters are random quantities, while variables 
denoted by lower case letters fixed quantities. Hence, column sums and cell counts arc random, while row 
sums and the overall sum are fixed quantities in a NNCT. 
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Table 1: The NNCT for two classes. 



Pieloul (|l96l[ ) used Pearson's test of independence for testing segrega tion. Due to the ease i n com - 
putation and interpretation, Pielou's test of segregation is used frequently ( Meagher and Burdick (1983)) 
for both completely mapped and sparsely sampled data. For ex ample, Pielou' s test is used for the segre- 
gation between males and females in dioecious species (see, e.g. , Herreral 1 1988| ) and Armstrong and Irvine! 
( 1989h ) and between differen t species (iGood and ^Vhipple! 
for completely mapped data (jMeagher and Burdick! (|l980r ). 



1 9821) ) . How ever Pielou's test is not appropriate 
DixonI ( I994I )). since the test of independence 



requires independence between cell-counts (and rows and columns also), which is violated under RL or CSR 
independence. In fact, this independence between cell-counts is violated for spatial data in general and in 
particular it i s violated under the null pat terns, so Pielou's test is not of the desired size. This problem was 
first noted bv !Meagher and Burdick! (|l980! ) who identify the main source of it to be reflexivity of (base, NN) 
pairs. A (base, NN) pair {X,Y) is reflexive if (Y, X) is a l so a (b ase, NN) pair. As an alternative, they suggest 



using Monte Carlo simulations for Pielou's test. iDixonI (|1994) derived the appropriate asymptotic sampling 



distribution of cell counts using Moran join count statistics 



Moran! ( 1948 )) and hence the appropriate test 



which also has a x^-distribution asymptotically. !Dixon ( 199'! ) also states that although Pielou's test is not 
appropriate for completely mapped data, it may be appropriate for sparsely sampled data. 



4 Cell-Specific Tests of Segregation 



In this section, we describe Dixon's cell-specific test of segregation and introduce a new type of cell-specific 
test based on NNCTs. 



4.1 Dixon's Cell-Specific Tests of Segregation 



The level of segregation can be estimated by comparing the observed cell counts to the expected cell counts 
under RL of points whose locations are fixed or a realization of points from CSR independence. Dixon 



demonstrates that under RL, one can write down the cell frequencies as Moran join count statistics (IMoran 



1948 )). H e then derives the means, variances, and covariances of the cell counts (i.e., frequencies) (see. lDixon 



1994 andlDijconl (|2002f )) 



Under RL, the expected cell count for cell is 



ni(?7.j - l)/(?7. - 1) if i = j. 



nj?ij7(n- 1) 



(1) 



where Ui is a realization of Ni, i.e., is the fixed sample size for class i for i ~ 1, 2, . . . , (7. Observe that the 
expected cell counts depend only on the size of each class (i.e., row sums), but not on column sums. 



The test statistic suggested by Dixon is given by 



r,D _ 



(2) 



where 



, \ {n + R)pii + (2n -2 R + Q)ptii + {ri^ - 'in - Q + R)piiii - {npii)'^ ifi=j, 
[^npij + Qpiij + (n — Sn ~ Q + Rjpiijj — [npij) itif^j, 

with pxx, Pxxx, and Pxxxx are the probabilities that a randomly picked pair, triplet, or quartet of points, 
respectively, are the indicated classes and are given by 



Pi. 



rii (rtj 
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(4) 



1) 



n(n-l)(n-2) (71 - 3) 



Furthermore, R is twice the number of reflexive pairs and Q is the number of points with shared NNs, which 
occurs when two or more points share a NN. Then Q = 2 {Q2 + 3Q3 + 6Q4 + IOQ5 + 15 Qe) where Qfc 
is the number of points that serve as a NN to other points k times. Furthermore, under RL Q and R are 
fixed quantities, as they depend only on the location of the points, not the types of NNs. So the sampling 
distribution is appropriate under RL (see also Remark l5.2p and Zj^ asymptotically has iV(0, 1) distribution. 
But unfortunately, for g > 2 the asymptotic normality of the off-diagonal cells in NNCTs is not rigorously 
established ye t, although extensive Monte Carlo simulations indicate approximate normality for large samples 
( DixonI (|2002l) ). One-sided and two-sided te sts are possib le for each cell (i, j) using the asymptotic normal 
approximation of Zj^ given in Equation ([2]) ( Dixon ( 1994 )). 



Under CSR independence, the quantities Q and R are random, hence the sampling distributions of the cell 
counts arc conditional on these quantities. Hence the expected cell counts in ([1]) and the cell-specific test in 
^ and the relevant discussion are similar to the RL case. The only difference is that under RL, the quantities 
Q and R arc fixed, while under CSR independence they are random. That is, under CSR independence, Z^' 
asymptotically has N{0, 1) distribution conditional on Q and R. 



4.2 A New Cell-Specific Test of Segregation 

In standard cases like multinomial sampling with fixed row totals and conditioning on the column totals, the 
expected cell count for cell in contingency tables is E[A'ij] = "~^r^- We first consider the difference 

Ay := Nij — ^^^^ for cell Notice that under RL, Ni = Ui are fixed, but Cj are random quantities and 

= Z^Li hence 



Then under RL 

r ni(ni-l] _ 

EfA.-,.l = <! J"ri) 



E[A,,] = nr^„ /; (5) 



For all i, E[Cj] = rij, since 



iijirij — 1) 



(n-1) 



Therefore, 



{ni^m — l) -r ■ ■ 

rii rij Ui rij ■ / ■ V / 

Notice that the expected value of is not zero under RL. Hence, instead of A^j, in order to obtain 
expected value for our test statistic, we suggest the following: 

Then E[Tjj] = 0, since, for i = 

(ni — 1) , nArij — 1) (rij — 1) 
E T,, - E TV,, - L^^E a = 7 - 7^^", = 0, 

(ri — 1) (n — 1) (n — 1) 

and for i ^ j , 

E[T,,] _ E[iV,,] - -^^nC,] ~ - 7^7^"^' - 

For the variance of , we have 

1 _ J^^^t^'^] + ^Var[Q] - 2^Cov[7V,„ Q] if z = 
[ Var [TV,,] + ^Var [Q] - 2 (^^T) Gov [TV,, , C,] if z ^ j , 

where Var[7Vjj] are as in Equation (O, Var[Cj] = X^Li Var[TVjj]+X]fc5^, EiCovjTVj^TVfcj] and Cov[iV,y, = 
ELi Cov[iV„,7Vfcj] with Caw[N,j,Nki] are as in Equations (4)-(12) of lDixonI l|2002f) . 

As a new cell-specific test, we propose 



Re call that in the two-class case, each cell count Nij has asymptotic normal distribution ( Cuzick and Edwards! 
ience, Zf^ also converges in law to -/V(0, 1) as n ^ oo. Moreover, one and two-sided versions of this 
test are also possible. 



Under CSR independence, the distribution of the test statistics above is similar to the RL case. The only 

■N 



difference is that Z/^ asymptotically has N{Q, 1) distribution conditional on Q and R 



5 Overall Tests of Segregation 



In this section, we describe Dixon's overall test of segregation and introduce a new overall test based on 
NNCTs. 



5.1 Dixon's Overall Test of Segregation 



In the multi-class case with q classes, combining the cell-specific tests in Section BTTl iDixon ( 2002 ) suggests 
the quadratic form to obtain the overall segregation test as follows. 



Cd = (N-E[N])'S^(N-E[N]) 



(10) 



where N is the q^ x 1 vector of q rows of NNCT concatenated row-wise, E[N] is the vector of E[A'ij] which 
are as in Equation ([T]), is the q^ x q^ variance-covariance matrix for the cell count vector N with diagonal 
entries equal to Var[A^M] and off-diagonal entries bein g CovlNg, N ki] for ^ {k,l)- The explicit forms 



of t he variance an d covariance terms are provided in (jPixon (|2002h ). Also, Yljj is a generalized inverse of 



(ISearld tOO^ ) and ' stands for the transpose of a vector or matrix. Then under RL Cd has 



distribution asymptotically. Furthermore, the test statistics Zjj are dependent, hence their squares do not 
sum to Cjv- Under CSR independence, the distribution of C13 is conditional on Q and R. 



5.2 A New Overall Test of Segregation 



Instead of combining the cell-specific tests in Section 14.11 we can also combine the new cell-specific tests in 



Section l4?2l Let T be the vector of q^ Tij values, i.e. 



T — [Til, ri2, . . . , Tlq, T21, T22, . . . , T2q, . . . , Tqq]' , 

and let E[T] be the vector of Tij values. Note that E[T] = 0. Hence to obtain a new overall segregation test, 
we use the following quadratic form: 

Cn - T'S^T (11) 

where Sat is the q^ x variance-covariance matrix of T. Under RL Cn has a xfq-iy distribution asymp- 
totically, since rank of Syv is {q— 1)^. Furthermore, the test statistics Zfj arc dependent, hence their squares 
do not sum to Cat. 

Under RL, the diagonal entries in the variance-covariance matrix Sjv are Var[rij] which are provided in 
Equation ([H]). For the off-diagonal entries in EjVj i-e., Cov[ry , Tf^i] with i ^ k and j ^ I, there are four cases 
to consider: 

case 1: i ^ j and k ~ I, then 



Nii — ^ — -T-Ci,Nkk — -7— — TV^'k 
(n — 1) (n — 1) 



(n — 1) [n — 1) (n — 1)- 



case 2: i = j and k I, then 



Co^^[Tu,nl] = Gov 



Co^^[Nu,Nkl] 



[n, - 1) 
(n-1) 

rik 



{n-1) 



C, 



(n-1) 



Cov[iV,„ Q] l!^l_ilcov[iV,.,, a] + l^i-i)jiCov[Q, G]. (13) 



case 3: i j and k = I, then Co\-[Tij, Tkk] = Cov[Tfcfc, Tij], which is essentially case 2 above, 
case 4: i j and k ^ I, then 



Cov[ry,r,,] = Cov 



/v. . : 

(n-1) 



nk 



(n-1) 



Ci 



Cow[N,j,Nki\ - -^^Cov[iVy , G] 
[n i j 



Cov[iVfc,,Q] + /^Cov[Q,G]. (14) 



(n-1) 



{n-iy 



In all the above cases, Cov[iVy, A^fcj] are as in Dixon ( 20021 ). Cov[iVy, C;] = X]fc=i ^o'^I^y ' ^fe'] ^^'^ 
Cov[C, , = ELi ELi Cov[7V,,; , Ni,]. 

Under CSR independence, the distribution of Cat is as in the RL case, except that it is conditional on Q 
and R. 

Remark 5.1. Comparison of Dixon's and New NNCT- Tests: Dixon's cell-specific test in ([2]) depends on 
the frequencies of (base, NN) pairs (i.e., cell counts), and measures deviations from expected cell counts. On 
the other hand, the new cell-specific test in ^ can be seen as a difference of two statistics and has expected 
value is for each cell. For the cell-specific tests, the z-score for cell (i, j) indicates the level and direction of 
the interaction of spatial patterns of base class i and NN class j. If Zj^ > then class i exhibits segregation 
from other classes, and if Zf^ < then class i exhibits lack of segregation from other classes. The same holds 
for the new cell-specific tests. Furthermore, cell-specific test for cell measures the interaction of class j 
with class i. When i = j this interaction is the segregation for class i, but if i 7^ j, it is the association of 
class j with class i. Hence for i j cell-specific test for cell is not symmetric, as interaction of class 

j with class i could be different from the interaction of class i with class j. However, new cell-specific tests 
use more of the information in the NNCT compared to Dixon's tests, hence they potentially will have better 
performance in terms of size and power. 

Dixon's overall test combines Dixon's ccU-spccific tests in one compound summary statistic, while new 
overall test combines the new cell-specific tests. Hence the new overall test might have better performance in 
terms of size and power, as it depends on the new cell-specific tests. □ 

Remark 5.2. The Status of Q and R under RL and CSR Independence: Under RL, Q and R are 

fixed quantities, but under CSR independence they arc random. The variances and covariances Var[A^ij] and 
Cov[Nij,Nki] and all the quantities depending on these quantities also depend on Q and R. Hence under 
CSR independence, they arc variances and covariances conditional on Q and R. The unconditional variances 
and covariances can be obtained by replacing Q and R by their expectations. 

Unfortunately, given the difficulty of calculating the expectations of Q and R under CSR independence, 
it is reasonable and convenient to use test statistics employing the conditional variances and covariances even 
when assessing their behavior under CSR independence. Alternatively, one can estimate the expected values 
of Q and R empirically and substitute these estimates in the expressions. For example, for the homogeneous 
planar Poisson process, we have E[(5/?i] « .632786 and E[i?/n] « 0.621120. (estimated empirically by 1000000 
Monte Carlo simulations for various values of n on unit square). □ 



5.3 The Two-Class Case 



In the two-class case, iDixonI (|l994l ) calculates Zu = {Nu — F,[Nii]) j yjN^x\Ni^ for both i <E {1,2} and then 
combines these test statistics into a statistic that is equivalent to Cd in Equation ()10p and asymptotically 
distributed as xi- The suggested test statistic is 



Cd = Y'E'^Y 



N-n - E[7Vnl 

N22 - E[N22] 



Var[iVn] 

Cov[Afii,iV22] 



Cov[Afii,Af22] 

Var[iV22] 



iVii - E[iVii] 

N22 - E[N22] 



(15) 



Notice that this is also equivalent to C = ^aa+^Ib-^^ ZaaZbb ^j^^^g ^ ^ Nii-einu] ^ ^ iV22-E[iV22] 

' yVar[JVii] yVar[]V22] 

and r = Cov [ A^i 1 , Ar22 ] / V Var [iVi 1 ] Var [iV22 ] . Notice that Zaa = and Zbb = ^m. Furthermore, Cd has a xi 
distribution and Cn has a Xi distribution asymptotically. 

In the two-class case, segregation of class i from class j implies lack of association between classes i and j {i 7^ j) 
and lack of segregation of class i from class j implies association between classes i and j {i ^ j), since — —Zi2 
for i = 1,2. Likewise for the new cell-specific tests, since ZfJ — —Z^j for j = 1,2. In the multi-class case, a positive 
jz-score, Z^ , for the diagonal cell indicates segregation, but it does not necessarily mean lack of association 

between class i and class j {i 7^ j), since it could be the case that class i could be associated with one class, yet not 
associated with another one. Likewise for the new cell-specific tests. 

Remark 5.3. Asymptotic Structure for the NNCT-Tests: There are two major types of asymptotic structures for 
spatial data jLahiril (|l996l )V In the first, any two observations are required to be at least a fixed distance apart, hence 
as the number of observations increase, the region on which the process is observed eventually becomes unbounded. 
This type of sampling structure is called "increasing domain asymptotics" . In the second type, the region of interest is 



a fixed bounded region and more and more points are observed in this region. Hence the minimum distance between 
da ta point s tend s to zero as the sample size tends to infinity. This type of structure is called "infill asymptotics" due 
to lCressi3 (| 19931 ). The sampling structure in our asymptotic sampling distribution could be either one of increasing 
domain or infill asymptotics, as we only consider the class sizes and hence the total sample size tending to infinity 
regardless of the size of the study region. □ 



6 Empirical Significance Levels in the Two- Class Case 

In this section, we provide the empirical significance levels for Dixon's and the new overall and the cell-specific tests 
in the two-class case under RL and CSR independence patterns. 

6.1 Empirical Significance Levels under CSR Independence of Two Classes 

First, we consider the two-class case with classes X and Y . We generate ni points from class X and n2 points from 
class Y both of which are independently uniformly distributed on the unit square, (0, 1) x (0, 1). Hence, all X points 
are independent of each other and so are Y points; and X and Y are independent data sets. Thus, we simulate the CSR 
independence pattern for the performance of the tests under the null case. Notice that this will imply randomness in 
the NN structure, which is the null hypothesis for our NNCT-tests. We generate X and Y points for some combinations 
of ni, 712 £ {10, 30, 50, 100} and repeat the sample generation Nmc = 10000 times for each sample size combination in 
order to obtain sufficient precision of the results in reasonable time. At each Monte Carlo replication, we construct 
the NNCT, then compute the overall and cell-specific tests. Out of these 10000 samples the number of significant 
outcomes by each test is recorded. The nominal significance level used in all these tests is a = .05. The empirical 
sizes are calculated as the ratio of number of significant results to the number of Monte Carlo replications, Nmc- For 
example empirical size for Dixon's overall test, denoted by Su, is calculated as q_d ^ X2(-05)) where 

i is the value of Dixon's overall test statistic for iteration i, X2(-05) is the 95"* percentile of xl distribution, and 
I(-) is the indicator function. The empirical sizes for other tests are calculated similarly. 

We present the empirical significance levels for the NNCT-tests in Tabled where S|^j and Sf^j are the empirical 
significance levels of Dixon's and the new cell-specific tests, respectively, Sd is for Dixon's and Sjv is for the new 
overall tests of segregation. The empirical sizes significantly smaller (larger) than .05 are marked with (*), which 
indicate that the corresponding test is conservative (liberal). The asymptotic normal approximation to proportions 
are used in determining the significance of the deviations of the empirical sizes from the nominal level of .05. For these 
proportion tests, we also use a — .05 to test against empirical size being equal to .05. With Nmc = 10000, empirical 
sizes less than .0464 are deemed conservative, greater than .0536 are deemed liberal at a = .05 level. Notice that in 
the two-class case Si^i = ai^2 and S^^i = S2'2, since A'^12 = ni — A^n and A'^21 = n2 — N22- Notice also that Sf^i = 02^1 
and a^2 = ci2,2j since Tn = —T21 and T12 — ~T22- So only Sf^i, 02,2, Si^i, S^2) cio, and Sjv are presented in Table 
[2I The empirical sizes are also plotted in Figure [1] where the horizontal lines are the nominal level of .05 and upper 
and lower limits for the empirical size (i.e., .0464 and .0536). 

Observe that Dixon's cell-specific test for cell (1, 1) (i.e., the diagonal entry with base and NN classes are from 
the smaller class) is about the desired level for equal and large samples (i.e., ni = 712 > 30), is conservative when 
at least one sample is small (i.e., rii < 10), liberal when sample sizes are large but different (i.e., 30 < ni < 712). 
It is most conservative for (ni,n2) = (10,50). On the other hand, Dixon's cell-specific test for cell (2,2) (i.e., the 
diagonal entry with base and NN classes are from the larger class) is about the desired level for almost all sample 
size combina tions. For Di xon's cell-specific tests, if at least one sample size is small, the normal approximation is not 
appropriate. iDixonI (|l994l ) recommends Monte Carlo randomization instead of the asymptotic approximation for the 
corresponding cell-specific tests when cell counts are less than or equal to 10; and when some cell counts are less than 
5, he recommends Monte Carlo randomization for the overall test. When sample sizes are small, rii < 10 or large but 
different (30 < 7ii < 722) it is more likely to have cell count for cell (1, 1) to be < 10, however for cell (2, 2) cell counts 
are usually much larger than 10, hence normal approximation is more appropriate for cell (2, 2). 

The new cell-specific tests yield very similar empirical sizes for both cells (1, 1) and (2, 2) and are both conservative 
when ni < 30 and about the desired level otherwise. However, new cell-specific test for cell (1, 1) is less conservative 
than that of Dixon's, since Tn is less likely to be small because it also depends on the column sum. 

Dixon's overall test is about the desired level for equal and large samples (i.e., ni — n2 > 30), is conservative when 
at least one sample is small (i.e., Ui < 10), liberal when sample sizes are large but different (i.e., 30 < 7ii < 712). It is 
most conservative for (711,712) — (10,50). The new overall segregation test is conservative for small samples and has 
the desired level for moderate to large samples. 

Moreover, we not only vary samples size but also the relative abundance of the classes in our simulation study. The 



Empirical Size Plots for the NNCT-Tests under CSR Independence of Two Classes 
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Figure 1: The empirical size estimates of the cell-specific tests for cells (1,1) (left), cell (2,2) (middle), and 
overall test of segregation (right) under the CSR independence pattern in the two-class case. The empirical 
sizes for Dixon's and the new NNCT-tests are plotted in circles (o) and triangles (A), respectively. The 
horizontal lines are located at .0464 (upper threshold for conservativeness) , .0500 (nominal level), and .0536 
(lower threshold for liberalness). The horizontal axis labels: 1=(10,10), 2==(10,30), 3=(10,50), 4=(30,30), 
5=(30,50), 6=(50,50), 7=(50,100), 8=(100,100). 



differences in the relative abundance of classes seem to affect Dixon's tests more than the new tests. See for example 
cell-specific tests for cell (1, 1) for sample sizes (30, 50) and (50, 100), where Dixon's test suggests that class X (i.e., 
class with the smaller size) is more segregated which is only an artifact of the difference in the relative abundance. 
Likewise, Dixon's overall test seems to be affected more by the differences in the relative abundance. On the other 
hand, the new tests are more robust to differences in the relative abundance, since they depend on both row and 
column sums. 

Thus we conclude that Type I error rates of the new overall and cell-specific tests are more robust to the differences 
in sample sizes. Furthermore, the new tests for cells (1,1) and (2,2) and the new overall test exhibit very similar 
behavior under CSR independence. Dixon's cell-specific test for cell (2, 2) is closest to the desired level. 
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Table 2: The empirical significance levels for Dixon's and new cell-specific and overall tests in the two-class 
case under Ho : CSR independence with Nmc = 10000, ni,7i2 in {10,30,50,100} at the nominal level of 
a = .05. =: empirical size significantly less than .05; i.e., the test is conservative. ^: empirical size significantly 
larger than .05; i.e., the test is liberal, af, and af^ are for the empirical significance levels of Dixon's and 
the new cell-specific tests, respectively, for i = 1,2; an is for Dixon's and S^v is for the new overall tests of 
segregation. 



6.2 Empirical Significance Levels under RL of Two Classes 

Recall that the segregation tests we consider are conditional under the CSR independence pattern. To evaluate their 
empirical size performance better, we also perform Monte Carlo simulations under the RL pattern, for which the tests 
are not conditional. For the RL pattern we consider three cases, in each of which, we first determine the locations of 
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Figure 2: The fixed locations for which RL procedure is applied for RL Cases (l)-(3) with ni = 712 = 100 in 
the two-class case. Notice that x-axis for RL Case (3) is differently scaled. 

points and then assign labels to them randomly. 

RL Case (1): First, we generate n — (ni + 722) points iid U{{0,1) x (0,1)) for some combinations of ni,n2 G 
{10,30,50,100}. The locations of these points are taken to be the fixed locations for which we assign the labels 
randomly. Thus, we simulate the RL pattern for the performance of the tests under the null case. For each sample size 
combination (711,712), we randomly choose rij points (without replacement) and label them as X and the remaining 
772 points as Y points. We repeat the RL procedure Nmc = 10000 times for each sample size combination. At each 
RL iteration, we construct the 2x2 NNCT, and then compute the overall and cell-specific tests. Out of these 10000 
samples the number of significant results by each test is recorded. The nominal significance level used in all these tests 
is a = .05. Based on these significant results, empirical sizes are calculated as the ratio of number of significant test 
statistics to the number of Monte Carlo replications, Nmc- 

RL Case (2): We generate tii points iid W((0, 2/3) x (0,2/3)) and 712 points iid W((l/3, 1) x (1/3,1)) for some 
combinations of Tii, 712 G {10, 30, 50, 100}. The locations of these points are taken to be the fixed locations for which 
we assign labels randomly. The RL process is applied to these fixed points Nmc = 10000 times for each sample size 
combination. The empirical sizes for the tests are calculated similarly as in RL Case (1). 

RL Case (3): We generate tii points iid W((0, 1) x (0, 1)) and 712 points iid W((2, 3) x (0, 1)) for some combinations of 
Til, 712 G {10, 30, 50, 100}. RL procedure and the empirical sizes for the tests are calculated similarly as in the previous 
RL Cases. 

The locations for which the RL procedure is applied in RL Cases (l)-(3) are plotted in Figure [2] for ni = n2 — 100. 
Observe that in RL Case (1), the set of points are iid U{{0, 1) x (0, 1)), i.e., it can be assumed to be from a Poison 
process in the unit square. The set of locations are from two overlapping clusters in RL Case (2), and from two disjoint 
clusters in RL Case (3). 

We present the empirical significance levels for the NNCT-tests in Table O where the empirical significance level 
labeling is as in Table [S] The empirical sizes are marked with and ^ for conservativeness and liberalness as in Section 

Observe that Dixon's cell-specific test for cell (1, 1) has the same trend under RL Cases (l)-(3): extremely conser- 
vative when the observed cell count is very likely to be < 5 (i.e., when ni < 10 and ni ^ Ui) liberal for most other 
cases, and close to being at the nominal size for large and equal sample sizes. New cell-specific test for cell (1, 1) is 
conservative for small samples and closer to the nominal level otherwise. Moreover the new test fluctuates with smaller 
deviations from the nominal level of .05 compared to Dixon's test. 

Dixon's cell-specific test for cell (2,2) is closer to the nominal level compare to that for cell (1, 1), but is still 
conservative or liberal for very different sample sizes. The new cell-specific test for cell (2, 2) is conservative when at 
least one sample is small (i.e., Ui < 30), about the desired level otherwise. Notice also that the new cell-specific tests 
for both cells (1, 1) and (2,2) have similar empirical size performance. 

Dixon's overall test is conservative for small samples and (tii, 712) = (30, 50), and about the desired level otherwise. 
The new overall test is conservative when at least one sample size is < 10, about the desired level otherwise for RL 
Cases (2) and (3). For RL Case (1), it is conservative for small samples and liberal for very different large samples, 
about the desired level for similar size large samples. 

Thus, under RL, for large samples the new cell-specific tests have better empirical size performance compared to 
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Table 3: The empirical significance levels in the two-class case under Ho : RL for RL Cases (l)-(3) with 
Nmc = 10000, ni,n2 in {10,30,50, 100} at the nominal level of a = .05. (=: empirical size significantly less 
than .05; i.e., the test is conservative. ^: empirical size significantly larger than .05; i.e., the test is liberal, 
cell = cell-specific tests, C = overall segregation test.) 

Dixon's cell-specific tests. On the other hand the performance of the new overall test depends on the RL Case, i.e., 
the allocation of the points confounds the results of the overall tests. 

Comparing Tables [5] and O we observe that the empirical sizes are not very similar under the RL and CSR 
independence patterns. Moreover, the performance of Dixon's cell-specific test for cell (2, 2) and the new overall test 
have different size performance under each RL Case. Although cell-specific test for cell (1, 1) is very similar for all RL 
and CSR independence Cases, the other tests are not very similar, and their sizes are closer to the nominal level under 
the CSR independence pattern compared to those under RL Cases. However, we can also conclude that the tests are 
usually conservative when at least one sample is small, regardless of whether the null case is RL or CSR independence. 



7 Empirical Significance Levels in the Three- Class Case 

In this section, we provide the empirical significance levels for Dixon's and the new overall and cell-specific tests of 
segregation in the three-class case under RL and CSR independence patterns. 

7.1 Empirical Significance Levels under CSR Independence of Three Classes 

The symmetry in cell counts for rows in Dixon's cell-specific tests and columns in the new cell-specific tests occur only 
in the two-class case. Therefore, in order to better evaluate the performance of cell-specific tests in the absence of 
such symmetry, we also consider the three-class case with classes X, Y, and Z under CSR independence. We generate 
ni, 712, ns points distributed independently uniformly on the unit square (0,1) x (0,1) from classes X, Y, and Z, 
respectively. That is, each data set of classes X, Y, and Z enjoy within sample and between sample independence. 
We generate data points for some combinations of ni, n2, ns G {10, 30, 50, 100}; and for each sample size combination, 
we generate data sets X, Y, and Z for Nmc ~ 10000 times. The empirical sizes and the significance of their deviation 
from .05 are calculated as in Section [6. II 

We present the empirical significance levels for the cell-specific tests in Table |4l where the estimated levels for 
Dixon's test are provided in the top, while for the new version in the bottom for cell {i,j) £ {(1, 1), (1, 2), (1, 3), . . . , (3, 3)} 
Notice that when at least one class is small (i.e., n; < 10) tests are usually conservative, with the Dixon's cell-specific 
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Table 4: The empirical significance levels for the Dixon's cell-specific and overall tests (top) and for the new 
version of the cell-specific and overall tests (bottom) in the three-class case under Ho : CSR independence 
with Nmc = 10000, ni,n2,n3 in {10,30,50,100} at the nominal level a = .05. The empirical level is 
significantly smaller than .05; ^: The empirical level is significantly larger than .05. 

tests being the most conservative. The empirical sizes for the new cell-specific tests are closer to the nominal level 
for all sample size combinations, while Dixon's cell-specific tests fiuctuate around .05 with larger deviations. In the 
three-class case, both of the overall tests exhibit similar performance in terms of empirical size, with Dixon's test being 
slightly more conservative for small samples. Thus, Type I error rates of the new cell-specific tests are more robust to 
the differences in sample sizes (i.e., relative abundance) and are closer to .05 compared to Dixon's cell-specific tests. 

7.2 Empirical Significance Levels under RL of Three Classes 

To remove the confounding effect of conditional nature of the tests under CSR independence, we also perform Monte 
Carlo simulations under the RL pattern. For RL with 3 classes, we consider two cases. In each case, we first determine 
the locations of points and then assign labels to them randomly. 

For the RL pattern we consider three cases, in each of which, we first determine the locations of points and then 
assign labels to them randomly. 

RL Case (1): First, we generate ni + n2 + ns points iid W((0, 1) x (0, 1)) for some combinations of ni,n2,n3 G 
{10,30,50,100}. The locations of these points are taken to be the fixed locations for which we assign the labels 
randomly. Thus, we simulate the RL pattern for the performance of the tests under the null case. For each sample 
size combination (ni,n2,n3) we pick ni points (without replacement) and label them as X, pick n2 points from the 
remaining points (without replacement) and label them as Y points, and label the remaining 723 points as Z points. 
We repeat the RL procedure Nmc ~ 10000 times for each sample size combination. At each RL iteration, we construct 
the 3x3 NNCT for classes X, Y, and Z and then compute the test statistics. Out of these 10000 samples the number 
of significant tests by the tests is recorded. The nominal significance level of .05 is used in all these tests. Based on 
the number of significant results, empirical sizes are calculated as before. 



RLCase(1) RLCase(2) 
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Figure 3: The fixed locations for which RL procedure is apphcd for RL Cases (1) and (2) with ni — n2 = 
713 = 100 in the two-class case. Notice that x-axis for RL Case (2) is differently scaled. 

RL Case (2): We generate ni points lid W((0, 1) x (0, 1)), n2 points iid W((2, 3) x (0, 1)), and ns points iid 2) x 

(2,3)) for some combinations of ni, 712,^3 £ {10,30,50,100}. RL procedure is performed and the empirical sizes for 
the tests are calculated similarly as in RL Case (1). 

The locations for which the RL procedure is applied in RL Cases (1) and (2) are plotted in Figure[3]for ni = n2 = 
713 = 100. In RL Case (1), the locations of the points can be assumed to be from a Poisson process in the unit square. 
In RL Case (2), the locations of the points are from three disjoint clusters. 

We present the empirical significance levels for the NNCT-tests in Table [5] where the empirical significance level 
labeling is as in Tabled The empirical sizes are marked with and * for conservativeness and liberalness as in Section 
16.11 Observe that under both RL Cases, the new cell-specific tests are closer to the nominal level, and are more robust 
to differences in sample sizes. The overall tests exhibit similar performance under each RL Case, with sizes for Dixon's 
overall test being slightly smaller than those for the new overall test for most sample sizes. 

Comparing Tables |4] and (5] we observe that, although the empirical sizes are not similar for the RL and CSR 
independence patterns, the trend is similar. That is, the new cell-specific tests are at about the desired level for most 
sample sizes, and more robust to the differences in sample sizes compared to Dixon's cell-specific tests. Overall tests 
have similar size performance under both RL Cases. 

Remark 7.1. Main Result of Monte Carlo Simulations for Empirical Sizes: iDixonI (| 19941 ) recommends Monte 
Carlo randomization test when some cell count (s) are smaller than 10 in a NNCT for his cell-specific tests and when 
some cell counts are less than 5 for his overall tests and we concur with his suggestion. We extend his suggestion to 
the new cell-specific test for cell {i,j) when sum of column j is < 10 which happens less frequently than cell 
being < 10. 

Dixon's and new overall tests exhibit similar performance in terms of empirical sizes: for small samples they are 
usually conservative and are about the nominal level otherwise. 

Thus, when sample sizes are small (hence the corresponding cell counts are < 5 for overall test and < 10 for 
cell-specific tests), the asymptotic approximation of the tests may not appropriate, especially for Dixon's cell-specific 
tests. In this case, the power comparisons should be carried out using the Monte Carlo critical values. On the other 
hand, for large samples, the power comparisons can be made using the asymptotic or Monte Carlo critical values. 

Furthermore, Dixon's cell-specific and overall tests are confounded by the differences in the relative abundance of 
the classes. On the other hand, the new cell-specific tests are more robust to differences in sample sizes (i.e., relative 
abundance) and less sensitive to the cell counts they pertain to. □ 

Remark 7.2. Monte Carlo Critical Values: When sample sizes are small so that some cell counts or column sums 
are expected to be < 5 with a high probability, then it will not be appropriate to use the asymptotic approximation 
hence the asymptotic critical values for the overall and cell-specific tests of segregation (see Remark 17. ip . In order 
to better evaluate the empirical power performance of the tests, for each sample size combination, we record the test 
statistics at each Monte Carlo simulation under the CSR independence cases of Sections 16. 1 1 and 17. 1 1 We find the 95*'' 
percentiles of the recorded test statistics at each sample size combination (not presented) and use them as "Monte 
Carlo critical values" for the power estimation in the following sections. For example, for Dixon's cell-specific test for 



cell (1,1) in the two-class case for (ni,n2) = (30,50), the Zf^i values are recorded for (ni,n2) = (30,50) under the 
CSR independence pattern as in Section 16.11 then the 95"" percentile of these statistics is used as the Monte Carlo 
critical value for (711,712) — (30,50). That is, under a segregation or association alternative with (7x1,712) = (30,50), a 
calculated test statistic is deemed significant if it is larger than this Monte Carlo critical value. □ 

8 Finite Sample Performance of NNCT-Tests under Various Pois- 
son and Inhomogeneous Point Processes 

In this section, we provide the finite sample performance of the NNCT-tests under point patterns that are different 
from RL or CSR independence. In part i cular , we will consider various versions of Poisson cluster processes and some 
other inhomogeneous processes (|Digg:lel (|2003l )l. 

First Version of Poisson Cluster Process (PCPl{np, n\,n2, (j)): In this process, first we generate Up parents 
iid on the unit square, (0, 1) x (0, 1) then for each parent ni/np offsprings are generated for sample X and n^lup for 
sample Y from radially symmetric Gaussian distribution with parameter a. Hence we generate n\ X and 712 Y points, 
respectively. In the first case, we use the same parent set for both X and Y points. In the second case, we use different 
parent sets for each of X and Y points. 

Second Version of Poisson Cluster Process (PCP2(7ip, ni, 712, cr)): In this process, we generate Up parents 
and 711 X and 712 Y offsprings as in the first version PCPl, except the offsprings are randomly allocated amongst the 
parents. 

For both versions of the above Poisson cluster processes, we take a G {0.05, .10, .20} and (711,712) £ {(30, 30), (30, 50), 
(50,50)}. 

Matern Cluster Process (MCP{it,r, jj,)): In this process, first we generate a Poisson point process of "parent" 
points with intensity k. Then each parent point is replaced by a random cluster of points. The number of points in 
each cluster are random with a Poisson(^) distribution, and the points are placed independently and uniformly inside 
a disc of radius r centered on the parent point. The parent points are not restricted to lie in the unit square; the 
parent process is effectively the uniform Poisson process on the infinite plane. We consider n — 5, r £ {.05, .10, .20} 
for both X and Y points and /i = 7ii/5 for X points and ^ = 712/5 for Y points. In case 1, we use the same parents 
for both X and Y offsprings, while in case 2, we generate different sets of parents with k — 5. For e ach of the above 
case s, we take (111,7 1 2) € {(50, 50), (50, 100), (100, 100)}. For more on Matern cluster processes, see (|Mat'ernl (|l986l ) 
and I WaagepetersenI (|2007l ')). 

Inhomogeneous Poisson Cluster Process {IPCP{X{x,y))): In this process, the intensity of the Poisson 
process is set to be X{x,y) which is a function of {x,y). We generate a realization of the inhomogeneous Poisson 
process with intensity function \{x,y) at spatial location {x,y) inside the unit square by random "thinning". That is, 
we first generate a uniform Poisson process of intensity A(a;, y), then randomly delete or retain each point, independently 
of other points, with retention probability p{x,y) = \{x,y)/lmax where imax = sup^^ ^^jg^Q ^^j^^q jj H^,y)- 

We take X{x,y) = ni^x + y for sample X. Then for sample Y, we take X{x,y) = n^^Jx + y in case 1, \{x,y) — 
n2y/x y in case 2, and X{x,y) — n2\x — y\ in case 3. That is, in case 1 X and Y points are from the same inhomo- 
geneous Poisson process; in cases 2 and 3, they are from different processes. For each of the above cases, we take 
(111, 712) € {(50, 50), (50, 100), (1 00, 100)}. For more on inhomogeneous Poisson cluster processes, see (iDiggld (|2003l ) 
and lBaddelev and Turneil (|2005l ')'). The rejection rates of the NNCT-tests are provided in Table |6l Observe that under 
PCPl with same parents, the rejection rates are slightly (but significantly) larger than 0.05. Hence under PCPl, the 
two classes are slightly segregated, so they do not satisfy randomness in the NN structure. Under PCPl with different 
parents, the two classes are strongly segregated. Under PCP2 with the same parents, the two classes satisfy random- 
ness in the NN structure, while under PCP2 with different parents, the two classes are strongly segregated. Notice 
that under these implementations of PCP, the rejection rates decrease as a increases; i.e., the level of segregation 
is inversely related to a. Under MCP with the same parents, the two classes satisfy randomness in NN structure; 
but with different parents, the classes are strongly segregated. Furthermore, as r increases, the level of segregation 
decreases under MCP with different parents. Under IPCP patterns, the two classes satisfy randomness in NN structure 
as long as the density functions are same or similar (see cases 1 and 2); but if the density functions are very different, 
we observe moderate segregation between the two classes. Notice also that this segregation is detected better by the 
new NNCT-tests. 
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Figure 4: The empirical power estimates for Dixon's tests (circles (o)) based on asymptotic critical values 
(black) and Monte Carlo critical values (red) and new tests (triangles (A)) based on asymptotic critical values 
(black) and Monte Carlo critical values (red) under the segregation alternative Hg in the two-class case. 
The horizontal axis labels: 1=(10,10), 2=(10,30), 3=(10,50), 4=(30,30), 5=(30,50), 6=(50,50), 7=(50,100), 
8=(100,100). 



9 Empirical Power Analysis in the Two- Class Case 

We consider three cases for each of segregation and association alternatives in the two-class case. 



9.1 Empirical Power Analysis under Segregation of Two Classes 

For the segregation alternatives, we generate Xi '~ W((0, 1 — s) x (0, 1 — s)) and Yj '~ W((s, 1) x (s, 1)) for i = 1, . . . , ni 
and j = 1, . . . ,n2. Notice the level of segregation is determined by the magnitude of s G (0, 1). We consider the 
following three segregation alternatives: 

H's:s = 1/G, : s = 1/4, and : s = 1/3. (16) 

Observe that, from Hg to Hj/' (i.e., as s increases), the segregation gets stronger in the sense that X and Y 
points tend to form one-class clumps or clusters more and more frequently. We calculate the power estimates using the 
asymptotic critical values based on the standard normal distribution for the cell-specific tests and the corresponding 
X^-distributions for the overall tests and using the Monte Carlo critical values. 

The power estimates based on the asymptotic critical values are presented in Table[7] We omit the power estimates 
of the ceU-specific tests for cells (1, 2) and (2, 1), since /?f^i = /^fa and = ^§^2; likewise /3f^i = and /J^fa = ^2.2- 
Observe that, for both cell-specific tests, as n = (ni + 722) gets larger, the power estimates get larger; for the same 
n = (ni +712) values, the power estimate is larger for classes with similar sample sizes; and as the segregation gets 
stronger, the power estimates get larger at each sample size combination. For both cells (1, 1) and (2,2), the new 
cell-specific tests have higher power estimates compared to those of Dixon's. Furthermore, the new overall test has 
higher power estimates compared to Dixon's overall test. 

The power estimates based on the asymptotic and Monte Carlo critical values under Hg are plotted in Figure 
[l] Observe that the power estimates based on Monte Carlo critical values are very similar to but tend to be slightly 
larger compared to the ones using the asymptotic critical values. However, this difi'erence do not influence the trend in 
the power estimates, that is new tests tend to have higher power based on either asymptotic critical values or Monte 
Carlo critical values. Hence we omit the power estimates based on Monte Carlo critical values under other segregation 
alternatives. 

Considering the empirical significance levels and power estimates, for small samples we recommend Monte Carlo 
randomization tests; for large samples, the new version of the cell-specific tests in the two-class case when testing 
against the segregation alternatives, as they are at the desired level for more sample size combinations and have higher 
power for each cell. Likewise, we recommend the new overall test over the use of Dixon's overall test for the segregation 
alternatives. 



9.2 Empirical Power Analysis under Association of Two Classes 



For the association alternatives, we consider three cases also. In each case, first we generate Xi ~ W((0, 1) x (0, 1)) for 
i = 1, 2, . . . , ni. Then we generate Yj for j = 1, 2, . . . , 712 as follows. For each j, we pick an i randomly, then generate 

Yj as Xi + Rj (cos Tj, sin Tj)' where Rj *~ U{0,r) with r € (0,1) and Tj W(0,27r). In the pattern generated, 
appropriate choices of r will imply association between classes X and Y . That is, it will be more likely to have (X, Y) 
or {Y,X) NN pairs than same-class NN pairs (i.e., {X,X) or (Y,Y)). The three values of r we consider constitute the 
following association alternatives; 

Hi:r = 1/4, H'J : r = 1/7, and H'a" : r = 1/10. (17) 

Observe that, from ff^ to H^/' (i.e., as r decreases), the association gets stronger in the sense that X and Y points 
tend to occur together more and more frequently. By construction, for similar sample sizes the association between 
X and Y are at about the same degree as association between Y and X. For very different samples, larger sample is 
associated with the smaller but the abundance of the larger sample confounds its association with the smaller. 

The empirical power estimates are presented in Table [S] Observe that the power estimates increase as the associa- 
tion gets stronger at each sample size combination and the power estimates increase as the equal sample sizes increase 
and as the very different sample sizes increase under each association alternative. 

Dixon's cell-specific test for cell (1, 1) has extremely poor performance for very different small samples (i.e., ni < 10 
and ni =^ 712). On the other hand, for larger samples, the empirical power estimates get larger as association gets 
stronger at each sample size combination. When samples are large, class Y is more associated with class X if n2 > ni 
and this is refiected in the empirical power estimates. The power estimates for the new cell-specific test for cell (1, 1) 
increase as the association gets stronger and equal sample sizes increase. Both tests have the lowest power estimates 
for (711,712) = (10,50), since cell counts and column sums could be very small for this sample size combination. 

Dixon's cell-specific test for cell (2, 2) has higher power estimates under weak association compared to those of the 
new cell-specific test. When association gets stronger, power estimates for Dixon's cell-specific test for cell (2,2) has 
higher power for smaller samples and lower power for larger samples compared to the new cell-specific tests. The new 
cell-specific test has the worst performance for (111,712) = (10,50), in which case, column sums could be small. 

Dixon's overall test has similar power as the new overall test for smaller samples; and new overall test has higher 
power estimates for larger samples. 

Furthermore, empirical power estimates based on Monte Carlo critical values exhibit similar behavior hence not 
presented. 

Considering the empirical significance levels and power estimates, for small samples we recommend Monte Carlo 
randomization or simulation approach; for larger samples we recommend both Dixon's and new overall and cell-specific 
tests for testing against the association alternatives, as it will not be very likely to know the degree of association a 
priori. 



10 Empirical Power Analysis in the Three-Class Case 

We consider three cases for each of segregation and association alternatives in the three-class case. 

10.1 Empirical Power Analysis under Segregation of Three Classes 

For the segregation alternatives, we generate Xi '~ W((0, 1 — 2s) x (0, 1 — 2s)), Yj '~ U{(2s, 1) x (2s, 1)), and Ze '~ 
U{{s, 1 — s) x (s, 1 — a)) for i = 1, . . . , 711, j — 1, . . . , 712, and £ = !,..., 713. Notice that the level of segregation is 
determined by the magnitude of s £ (0, 1/2). We consider the following three segregation alternatives: 

Hs,:s = 1/12, Hs^: 1/8, and Hss-s = 1/6. (18) 

Observe that, from Hsi to Jfg, (i.e., as s increases), the segregation gets stronger in the sense that X, Y, and 
Z points tend to form one-class clumps or clusters more frequently. Furthermore, for each segregation alternative, X 
and Y are more segregated compared to Z and X ox Z and Y . 

We plot the empirical power estimates for the NNCT-tests in Figures[5]and[6l The test statistics are mostly positive 
for diagonal cells which implies segregation of classes and are mostly negative for off-diagonal cells which implies lack 
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Figure 5: The empirical power estimates of Dixon's cell-specific tests (circles (o)) and the new cell-specific tests 
(triangles (A)) under the segregation alternatives Hsi (black), Hs^ (red), and iJsg (blue) in the three-class 
case. The horizontal axis labels are: 1=(10,10,10), 2=(10,10,30), 3=(10,10,50), 4=(10,30,30), 5=(10,30,50), 
6=(30,30,30), 7=(10,50,50), 8=(30,30,50), 9=(30,50,50), 10=(50, 50,50), 11=(50,50,100), 12=(50,100,100), 
13=(100,100,100). Notice that they are arranged in the increasing order for the first and then the second 
entries. The size values for discrete sample size combinations are joined by piecewise straight lines for better 
visualization. 



Empirical Power Estimates of Overall Tests under Hs 
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Figure 6: The empirical power estimates of Dixon's overall test (circles (o)) and the new overall test (triangles 
(A)) under the segregation alternatives Hs^ (black), Hs^ (red), and Hs^ (blue) in the three-class case. The 
horizontal axis labels are as in Figure El 

of association between classes. For both cell-specific tests for the diagonal cells for i = 1,2,3, as equal sample 
sizes get larger, the power estimates get larger under each segregation alternative; and as the segregation gets stronger, 
the power estimates get larger for each cell at each sample size combination. The higher degree of segregation between 
X and Y is reflected in cells (1, 1) and (2,2). Furthermore, since the sample sizes satisfy n\ < n2 in our simulation 
study, cell (2, 2) power estimates tend to be larger. Since class Z is less segregated from the other two classes, cell 
(3, 3) power estimates tend to be lower than the other diagonal cell statistics. Notice also that off-diagonal cells are 
more severely affected by the differences in the sample sizes. 

The higher degree of segregation between classes X and Y can also be observed in cells (1,2) and (2, 1) power 
estimates, since more segregation of these classes imply higher negative values in these cells' test statistics. The lesser 
degree of segregation between classes X and Z can be observed in cells (1,3) and (3, 1), as they yield much lower 
power estimates compared to the other cells. Although Y and Z are segregated in the same degree as X and Z, the 
power estimates for cells (2, 3) and (3, 2) are larger than those for cells (1, 3) and (3, 1), since (ni +713) < (712 -f ^3) in 
our simulation study and larger sample sizes imply higher power under the same degree of segregation. 

Furthermore, the power estimates for the new cell-specific tests tend to be higher for each cell under each segregation 
alternative for each sample size combination. In summary, in the three-class case, new cell-specific tests have better 
performance in terms of power. 

The performance of the overall tests are similar to the performance of cell-specific tests for the diagonal cells: 
power estimates increase as the segregation gets stronger; power estimates increase as the sample sizes increase; and 
new overall test has higher power than Dixon's overall test. 

The empirical power estimates based on the Monte Carlo critical values yield similar results, hence not presented. 

Considering the empirical significance levels and power estimates, for small samples we recommend Monte Carlo 
randomization for these tests; for larger samples we recommend the new versions of the overall and cell-specific tests 
for testing against the segregation alternatives, as they either have about the same power as or have larger power than 
Dixon's tests. Furthermore, if one wants to see the level of segregation between pairs of classes, we recommend using 
the diagonal cells, (i, i) for i = 1, 2, 3 as they are more robust to the differences in class sizes (i.e., relative abundance) 
and more sensitive to the level of segregation. 

10.2 Empirical Power Analysis under Association of Three Classes 

For the association alternatives, we also consider three cases. In each case, first we generate Xi ~ W((0, 1) x (0, 1)) 
for i = 1, 2, . . . , ni . Then we generate Yj and Zi for j = 1,2, ... ,712 and = 1, 2, . . . , na as follows. For each j, we pick 
an i randomly, then generate RJ '~ U{0,ry) with ry £ (0, 1) and Tj '~ U{0,2tt) set Yj := Xi + RJ (cos Tj, sin Tj)'. 



Similarly, for each we pick an i randomly, then generate Rf ~ W(0, r^) with € (0, 1) and Ue ~ U{0, 2-jt) and set 
Ze ■- Xi + Rf (cos Ui, sin Ui)' . 

In the pattern generated, appropriate choices of ry (and Vz) values will imply association between classes X and 
Y (and X and Z). The three association alternatives are 

Ha, ■■ ry = 1/7, r, = 1/10, Ha^ ■ ry = 1/10, r, = 1/20, : r„ = 1/13, = 1/30. (19) 

Observe that, from Hai to i^As (i.e., as Vy and decrease), the association between X and Y gets stronger in the 
sense that Y points tend to be found more and more frequently around the X points. Likewise for X and Z points. 
Furthermore, by construction, classes X and Z are more associated compared to classes X and Y . 
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Figure 7: The empirical power estimates of Dixon's cell-specific tests (circles (o)) and the new cell-specific 
tests (triangles (A)) under the association alternatives Ha, (black), Ha2 (red), and i/^, (blue) in the three- 
class case. The horizontal axis labels are as in Figure [S] 

The power estimates for the NNCT-tests are plotted in Figures [7] and [8] The test statistics tend to be negative 
for the diagonal cells, which implies lack of segregation for the classes; positive for cells (1, 2), (2, 1), (1, 3), and (3, 1), 
which implies association between classes X and Y and association between classes X and Y; negative for cells (2, 3) 
and (3,2), which implies lack of association (and perhaps mild segregation) between classes Y and Z. 



Empirical Power Estimates of Overall Tests under Ha 




Figure 8: The empirical power estimates of Dixon's overall test (circles (o)) and the new overall test (triangles 
(A)) under the association alternatives Ha^ (black), Ha2 (red), and Ha^^ (blue) in the three-class case. The 
horizontal axis labels are as in Figure El 



At each sample size combination, as the association gets stronger, the power estimates increase. Further, the 
higher degree of association between X and Z compared to that of X and Y are reflected in higher power estimates 
for cell (1, 3) compared to cell (1, 2). For the same reason, power estimates for cell (3, 1) are higher than those for cell 
(2,1). 

For cells (1,2), (2,1), (1,3), and (3,1) the power estimates get larger as the equal sample sizes increase. The 
new cell-specific tests have higher power for cells (1,2) and (1,3) and Dixon's cell-specific tests have higher power for 
cells (2, 1) and (3, 1). By construction the classes Y and Z are not associated, instead they can be viewed as mildly 
segregated from each other. The test statistics for cells (2, 3) and (3, 2) are negative to indicate such segregation or 
lack of association between classes Y and Z. However, cell (3,2) power estimates are much larger than cell (3,2), 
which implies that class Z can be viewed as more segregated from class Y. As for the diagonal cells, the higher power 
estimates for cell (1, 1) for larger samples are indicative of high degree of association of classes Z and Y with class X. 
Such association, by construction, is barely refiected in cells (2,2) and (3,3). 

In summary, for cells with i < j, the new tests have higher power, while for cells with i > j, Dixon's 

tests have higher power. For the diagonal cells both versions of the cell-specific tests have about the same power 
performance. So we recommend both versions of cell-specific tests to be applied in a given situation and the results 
compared and interpreted carefully. 

The power estimates of the overall tests tend to increase as the association gets stronger at each sample size 
combination; as the sample sizes tend to increase, except for the sudden decrease at (ni,n2,n3) — (10,50,50), in 
which case cell (1, 1) counts and column 1 sums tend to be very small. Furthermore, Dixon's overall test has higher 
power compared to the new overall test. So Dixon's overall test is recommended for the association alternative over 
the new overall test. 

The empirical power estimates based on the Monte Carlo critical values yield similar results, hence not presented. 

Remark 10.1. Main Result of Monte Carlo Power Analysis: Based on the recommendations made in Remark 
17.11 when at least one sample size is small (in the sense that some cell count is < 5), we recommend Monte Carlo 
randomization for the NNCT-tests. For large samples, one can use asymptotic or Monte Carlo versions of the NNCT- 
tests. In Sections 19.11 19.21 110.11 and 110.21 we observe that under the segregation alternatives, the new cell-specific 
and new overall tests have higher power compared to Dixon's cell-specific and overall tests. Under the association 
alternatives, we observe that for cells with the associated class is the NN class, the new cell-specific tests have higher 
power, while for cells with the associated class as the base class, Dixon's cell-specific tests have higher power, and 
for diagonal cells Dixon's and new cell-specific tests have similar power. Additionally, Dixon's overall test has higher 
power than the new overall test for association. Thus we recommend both of the new and Dixon's NNCT-tests under 
the association alternatives. □ 



1 1 Examples 



We illustrate the tests on two ecological data sets: Pielou's Douglas-fir/ponderosa pine data (jPieloul (|l96ll )) and a 
swamp tree data (|Good and Whipple! (llQSd )!. 

11.1 Pielou's Data 



iPieloul (|l96l[ ) used a completely mapped data set that is comprised of two tree species: Douglas-fir trees 



{Pseudotsuga menziesii formerly P. taxifolia) and pondcrosa pine {Pinus po nderosa ) from a region in British 



Columbia. Her data set was also used by Dixon as an illustrative example (jDixorJ ||1994[ )). The question of 
interest is the type of spatial interaction between the two tree species. The corresponding 2x2 NNCT and 
the percentages for each cell are provided in Table O The cell percentages are with respect to the sample 
sizes of each species, for example, 86 % of Douglas-firs have NNs from Douglas firs and remaining 15 % of 
Douglas-firs have NNs from ponderosa pines. The row and column percentages are marginal percentages with 
respect to the total sample size. The percentage values are suggestive of segregation for both species. 

The raw data are not availab le, hence we c an not perform Monte Carlo simulation nor randomization 
versions of the tests. Fortunately, IPieloul (Il96ll ) provided Q = 162 and R = 134, hence we can calculate the 



test statistics and use the asymptotic approximation for these tests. The overall and cell-specific test statistics 
and the corresponding p- values (in parentheses) based on the asymptotic approximation, denoted by Pasy, 
are provided in Table [101 Although the locations of the tree species are not known, they can be viewed a 
priori resulting from different processes rather than some process affecting a posteriori the individuals of a 
single population. So the more appropriate null hypothesis is CSR independence of the trees. Hence our 
inference will be a conditional one (sec Remark 15. 2p . Observe that Dixon's and new overall test statistics 
yield significant p- values, implying some sort of deviation from CSR independence. In order to see the type of 
deviation, we apply the cell-specific tests. Both versions of the cell-specific tests for each cell are significant, 
implying significant deviation from CSR independence. The cell-specific test statistics are positive for the 
diagonal cells (1, 1) and (2, 2) (and negative for the off-diagonal cells (1, 2) and (2, 1)), impl ying segregatio n 



for both species. This is in agreement with what the NNCT suggests and the findings of (jDixoni (|1994[ )). 
However, Dixon's cell (1,1) statistics are much larger than cell (2,2) statistics, which may be interpreted as 
clustering of Douglas-firs is stronger than the clustering of ponderosa pines. Our simulation study indicates 
that this might be an artifact of the relative abundance of the tree species. On the other hand, new cell (1,1) 
and cell (2, 2) statistics are very similar, hence the segregation of both tree species arc at about the same 
degree. 

11.2 Swamp Tree Data 



Good and Whipple! (Il982h considered t he spat i al int eraction between tree species along the Savannah River 



South Carolina, U.S.A. From this data, Dixon! ( 2002 ') used a single 50m x 200m rectangular plot to illustrate 



his NNCT-tests. All live or dead trees with 4.5 cm or more dbh (diameter at breast height) were recorded 
together with their species. Hence it is an example of a realization of a marked multi-variate point pattern. The 
plot contains 13 different tree species, four of which comprises over 90 % of the 734 tree stems. The remaining 
tree stems were categorized as "other trees" . The plot consists of 215 water tupelos {Nyssa aquatica), 205 black 
gums (Nyssa sylvatica), 156 Carolina ashes (Fraxinus caroliniana) , 98 bald cypresses [Taxodium distichum), 
and 60 stems from 8 additional species (i.e., other species). A 5 x 5 NNCT-analysis is conducted for this data 
set. If segregation among the less frequent species is important, a more detailed 12 x 12 NNCT-analysis should 
be performed. The locations of these trees in the study region are plotted in Figure |9| and the corresponding 
5x5 NNCT together with percentages based on row and grand sums are provided in Table [TTl For example, 
for black gum as the base species and Carolina ash as the NN species, the cell count is 26 which is 13 % of 
the 205 black gums (which is 28 % of all trees). Observe that the percentages and Figure |9| are suggestive of 
segregation for all tree species, especially for Carolina ashes, water tupelos, black gums, and the "other" trees 
since the observed percentages of species with themselves as the NN are much larger than the row percentages. 

The locations of the tree species can be viewed a priori resulting from different processes, so the more 
appropriate null hypothesis is the CSR independence pattern. Hence our inference will be a conditional one 
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Figure 9: The scatter plot of the locations of water tupelos (triangles A), black gum trees (pluses +), Carolina 
ashes (crosses x), bald cypress trees (diamonds o), and other trees (inverse triangles V). 



(see Remark I5.2p . We calculate Q = 472 and R = 454 for this data set. We present Dixon's overall test of 
segregation and cell-specific test statistics and the associated p- values in Table [121 where pasy stands for the 
p- value based on the asymptotic approximation, pmc is the p- value based on 10000 Monte Carlo replication 
of the CSR independence pattern in the same plot and Prand is based on Monte Carlo randomization of the 
labels on the given locations of the trees 10000 times. Notice that Pasy, Pmc, and Prand are very similar for 
each test. We present the new overall test of segregation and cell-specific test statistics and the associated 
p- values in Table [T3l where p- values are calculated as in Table [T2l Again, all three p- values in Table [T3l are 
similar for each test. 

Dixon's and the new overall test of segregation are both significant implying significant deviation from the 
CSR independence pattern for at least for one pair of the tree species. Then to determine which pairs exhibit 
segregation or association, we perform the cell-specific tests. Dixon's and the new cell-specific tests agree for 
all cells (i.e., pairs) in term of significance at .05 level except for (B.C., B.C.), (BC,W.T), and (B.C.,C.A.) 
pairs. The statistics are all negative for the off-diagonal cells, except for (B.C.,C.A.) and (C.A., B.C.) pairs. 
Based on the Monte Carlo simulation analysis, the new test is more reliable to attach significance to these 
situations. The spatial interaction is significant between each pair which does not contain bald cypresses. 
That is, the new cell-specific test statistics are positive for the diagonal cells {i, i) for i = 1, 2, . . . , 5 and are 
significant for i = 1, 2, 3, 5 at .05 level (which also holds for Dixon's tests); and are negative for the off-diagonal 
cells (?, j) with i,j € {1,2,3,5} and i ^ j and significant for most of them. Hence each tree species except bald 
cypresses exhibit s signi ficant segregation from each other. These findings are mostly in agreement with the 
results of (|Dixonl (|2002h ). Hence except for bald cypresses, each tree species seem to result from a (perhaps) 



different first order inhomogeneous Poisson process. 

Based on the NNCT-tests above, we conclude that all tree species but bald cypresses exhibit significant 
deviation from the CSR independence pattern. Considering Figure m the corresponding NNCT in Table 
[TTl and the cell-specific test results in Tables [12] and [TH this deviation is toward the segregation of the tree 
species. However, these results pertain to small scale interaction at about the average NN distances. We 
might also be interested in the causes of the segregation and the type and level of interaction between the 
tree species at different scales (i.e., distances between the trees). To answer such questions, we also present 
the second-order analysis of the swamp tree data. We calculate Ripley's (univariate) L-function which is the 



modified version of K fimction as Lii(t) = t lKii{t)/Tr\ where t is the distance from a randomly chosen 



All Trees 



Water Tupelos 



Black Gums 





t(m) 

Caroline Ashes 




t(m) 




t(m) 
Other Trees 




t(m) 



Figure 10: Second-order analysis of swamp tree data. Functions plotted are Ripley's univariate L-functions 
Lii{t) — t for i = 0, 1, . . . , 5, where i = stands for all data combined, i = \ for water tupelos, i = 2 for 
black gums, i = 3 for Carolina ashes, z = 4 for bald cypresses, and i = 5 for other trees. Wide dashed lines 
around (which is the theoretical value) are the upper and lower (pointwise) 95 % confidence bounds for the 
L-functions based on Monte Carlo simulation under the CSR independence pattern. 



event (i.e., location of a tree). Ku^t) is an estimator of 

K{t) = A^^E[:^ of extra events within distance i of a randomly chosen event] 
with A being the density (number per unit area) of events and is calculated as 

= A-^^^ «;(/., Z,)I(d., < t)/N 



(20) 
(21) 



where A = N j A is an estimate of density [N is the observed number of points and A is the area of the study 
region), dij is the distance between points i and I(-) is the indicator function, w{li,lj) is the proportion 
of the circumference of the circle centered at k with radius dij that falls in the study area, which corrects 
for the boundary effects. Under CSR independence, L{t) — t = holds. If the univariate pattern exhibits 
aggregation, then L{t) — t tends to be positive, if it exhibits regularity then L{t) — t tends to be negative. The 
estimator K{t) is approximately unbiased for K{t) at each fixed t. Bias depends on the geometry of the study 
area and increases with t. For a re ctangular regi on it is recommended to use t values up to 1/4 of the smaller 
side length of the rectangle. See ( Digglel ( 20031 )) for more detail. So we take the values t e [0, 12.5] in our 
analysis, since the smaller side of the rectangular region of swamp tree data is 50 m. In Figure [TUl we present 
the plots of Lii{t) — t functions for each species as well as the plot of all trees combined. We also present the 
upper and lower (pointwise) 95 % confidence bounds for each Lii{t) — t. Observe that for all trees combined 
there is significant aggregation of trees (the ioo(*) ~ * curve is above the upper confidence bound) at all scales 
(i.e., distances). Water tupelos exhibit significant aggregation for the range of the plotted distances; black 
gums exhibit significant aggregation for distances t > 1 m; Carolina ashes exhibit significant aggregation for 
the range of plotted distances; bald cypresses exhibit no deviation from CSR independence for t < 5 m, then 
they exhibit significant spatial aggregation for i > 4 m; other trees exhibit significant aggregation for the 
range of plotted distances. Hence, segregation of the species might be due to different levels and types of 
aggregation of the species in the study region. 
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Figure 11: Pair correlation functions for all trees combined and for each species in the swamp tree data. Wide 
dashed lines around 1 (which is the theoretical value) are the upper and lower (pointwise) 95 % confidence 
bounds for the L-functions based on Monte Carlo simulation under the CSR independence pattern. 



We also calculate Ripley's bivariate i-function as Lijit) = y y^ijit) where Kij{t) is an estimator of 

Kij{t) = XJ^E[^ of extra type j events within distance i of a randomly chosen type i event] 
with Xj being the density of type j events. Then Kij{t) is calculated as 

K,,{t) = (a,A,a) ' < t), (22) 

i j 

where di^j, is the distance between fc*'' type i and Z*'' type j points, w{ik,ji) is the proportion of the 
circumference of the circle centered at fc*'* type i point with radius (i,^. that falls in the study area, which is 
used for edge correction. Notice that by construction, Lij{t) is symmetric in i and j, that is, Lij(t) = Lji(t) for 
all Under CSR independence, Lij(t) — t = holds. If the bivariate pattern is se gregation, then Lij{t) — t 
tends to be negative, if it is association then Lij{t) — t tends to be positive. See (Diggle ( 20031 )) for more 



detail. In Figure W2\ we present the bivariate plots of Lij{t) — t functions together with the upper and lower 
(pointwise) 95 % confidence bounds for each pair of species (due to the symmetry of Lij(t) there are only 10 
different pairs). Observe that for distances up to t w 10 m, water tupelos and black gums exhibit significant 
segregation (ii2(i) — t is below the lower confidence bound) and for the rest of the plotted distances their 
interaction is not significantly different from the CSR independence pattern; water tupelos and Carolina ashes 
are significantly segregated up to about t sa 10 m; water tupelos and bald cypresses do not have significant 
deviation from the CSR independence pattern for distances up to 4 m, for larger distances they exhibit 
significant segregation; water tupelos and the other trees do not deviate from CSR independence for the 
range of the plotted distances. Black gums and Carolina ashes are significantly segregated for t > 2 m; black 
gums and bald cypresses are significantly segregated for t > 2 m; black gum and other trees are significantly 
segregated for all the distances plotted. Carolina ashes and bald cypresses are significantly associated for 
distances larger than 3 m; and Carolina ashes and the other trees exhibit significant segregation for 3 < i < 7 



m and for f > 11 m they exhibit significant association. On the other hand, bald cypresses and other trees 
are significantly associated for distance larger than 4 m. 



But Ripley's A"-func t ion is cumulative, so interpreting the spatial interaction at larger distances is prob- 
lematic ( Wiegand et al. J 20071 ) V The (accumulative) pair correlation function g{t) is better for this purpose 



(jStovan and StovanI (|l994l )V The pair correlation function of a (univariate) stationary point process is defined 



as 

- -2^ 

where K'{t) is the derivative of K(t). For a univariate stationary Poisson process, g{t) — 1; values of g{t) < 1 
suggest inhibition (or regularity) between points; and values of g{t) > 1 suggest clustering (or aggregation). 
The pair correlation functions for all trees and each species for the swamp tree data are plotted in Figure 
fm Observe that all trees are aggregated around distance values of 0-1,3,4,5,7,9-10 m; water tupelos are 
aggregated for distance values of 0-4 and 5-7 m; black gums are aggregated for distance values of 1-6 and 8-11 
m; Carolina ashes are aggregated for all the range of the plotted distances; bald cypresses are aggregated for 
distance values of 2-8 and around 11m; and other trees are aggregated for all distance values except 3-5 m. 
Comparing Figures [10] and [TTl we see that Ripley's L and pair correlation functions detect the same patterns 
but with different distance values. That is, Ripley's L implies that the particular pattern is significant for a 
wider range of distance values compared to Q(t ), since Ripley's L is cumu lative, so the values of L at small 



scales confound the values of L at larger scales (jLoosmore and FordI (|2006( )). Hence the results based on pair 
correlation function g{t) are more reliable. 

The same definition of the pair correlation function can be applied to Ripley's bivariate K or L-functions 
as well. The benchmark value of Kij[t) = 7rt^ corresponds to g{t) = 1; g{t) < 1 suggests segregation of the 
classes; and g{t) > 1 suggests association of the classes. The bivariate pair correlation functions for the species 
in swamp tree data are plotted in Figure [131 Observe that water tupelos and black gums are segregated for 
distance values of 0-1 m; water tupelos and Carolina ashes are segregated for values of 0-1 and 2.5 m and are 
associated for values about 6 m; water tupelos and bald cypresses are segregated for 0-1, 5.5, 9.5, and 11 m 
and are associated for 6.5 m; water tupelos and other trees are segregated for 0-0.5 and 7 m and are associated 
for 8 m; black gums and Carohna ashes are segregated for 2-2.5, 3.5-4.5, 6-8.5, and 9.5-12 m; black gums and 
bald cypresses are segregated for 3.5, 5.5-6.5,7, and 9.5 m; black gums and other trees are segregated for 5 
and 6-7.5 m; Carolina ashes and bald cypresses are associated for 1.5-3, 5.5., and 7 m; Carolina ashes and 
other trees are associated for 5 and 9-10 m; and bald cypresses and other trees are segregated for 4 m and 
are associated for 3-4 and 6.5-7.5 m. 

However the pair correlation function estimates might have critical behavior for small t if g(t) > since 
the estima tor variance and hence the bias are considerably large. This problem gets worse especially in cluster 
processes ( Stovan and StovanI ( 19961 )). See for example Figures [Til and [T3l where the confidence bands for 



smaller t values are much wider compared to those for larger t values. So pair correlation function analysis is 
more reliable for larger distances and it is safer to use g{t) for distances larger than the average NN distance 
in the data set. Comparing Figure [TOj with Figure [11] and Figure [12] with Figure [13] we see that Ripley's L 
and pair correlation functions usually detect the same large-scale pattern but at different ranges of distance 
values. Ripley's L suggests that the particular pattern is significant for a wider range of distance values 
compared to g( t), since values of L at sma ll scales confound the values of L at larger scales where g{t) is more 
reliable to use I Loosmore and Ford (|2006l) ). 



While second order analysis (using Ripley's K and L-functions or pair correlation function) provides in- 
formation on the univariate and bivariate patterns at all scales (i.e., for all distances), NNCT-tests summarize 
the spatial interaction for the smaller scales (for distances about the average NN distance in the data set). In 
particular, for the swamp tree data average NN distance (± standard deviation) is about 1.8 (± 1.04) meters 
and notice that Ripley's L-function and NNCT-tests yield similar results for distances about 2 meters. 



12 Discussion and Conclusions 

In this article we introduce new overall and cell-specific tests of segregation based on nearest neighbor con- 
tingency tables (NNCTs). Such tests are referred to as NNCT-tests. We also consider Dixon's NNCT-tests, 
discuss the differences in these (new and Dixon's) NNCT-tests, present the asymptotic properties of them. 



compare the tests using extensive Monte Carlo simulations under RL and CSR independence and under var- 
ious segregation and association alternatives for two and three clas ses. We also illustrate the tests on two 
examples and compare them with Ripley's L-function I Riplevl ( 2004 )). 



NNCT-tcsts (i.e., overall and cell-specific tests of segregation) are used in testing randomness in the nearest 
neighbor (NN) structure between two or more classes. The overall test is used for testing any deviation from 
randomness in all the NNCT cells combined; cell-specific test for cell is used for testing any deviation 
from randomness in cell i.e., NN structure in which base class is i and NN class is j. This statistic 

tests the segregation or lack of it if i = j; the association or lack of it between classes i and j \i i j. The 
randomness in the NN structure is implied by the RL or CSR independence patterns. We demonstrate that 
under the CSR independence pattern, NNCT-tests are conditional on Q and R, while under the RL pattern, 
these tests are unconditional. In the two-class case, cell-specific tests are essentially different only for two 
cells, since cell (1,1) and (1,2) yield the same test statistic in absolute value for Dixon's cell-specific test, 
likewise for cells (2, 1) and (2, 2). Similarly, cell (1, 1) and (2, 1) yield the same test statistic in absolute value 
for the new cell-specific test, likewise for cells (1,2) and (2,2). 

Based on our Monte Carlo simulations, we conclude that the asymptotic approximation for the cell-specific- 
tests is appropriate only when the corresponding cell count in the NNCT is larger than 10; and for the overall 
test when all cell counts arc larger than 4. When at least one cell count is less than 5, we recommend the 
Monte Carlo randomization version of the overall tests; and when a cell count is less than 10, we recommend 
the Monte Carlo randomization of the cell-specific tests. When each cell count is larger than 5, the new 
versions of the segregation tests have empirical significance levels closer to the nominal level. Type I error 
rates (empirical significance levels) of the new cell-specific tests are more robust to the differences in sample 
sizes (i.e., differences in relative abundance). When some cell count (s) are less than 5 for overall test and 
less than 10 for the cell-specific tests, we compare the power of the tests using Monte Carlo critical values. 
For large samples, the power comparisons can be made using both the asymptotic or Monte Carlo critical 
values. For the segregation alternatives, we conclude that the new cell-specific and overall tests have higher 
power estimates compared to those of Dixon's tests. For the association alternatives, we observe that the 
best performer NNCT-test depends on the cell and level of association. When testing against association, the 
new cell-specific tests have higher power estimates for the upper triangular cells in the NNCT and Dixon's 
cell-specific tests have higher power estimates for the lower triangular cells in the NNCT. We recommend 
the new cell-specific and overall tests for the segregation alternatives. For the association alternatives, we 
recommend both versions of the overall and cell-specific tests. 

The CSR independence pattern assumes that the study region is unbounded for the analyzed pattern, 
which is not the case in practice. Edge effects are a constant problem in the analysis of e mpirical (i.e., bounded) 



data s ets and much effort has gone into the development of edge corrections methods (jYamada and Rogersen 



( 2003[ )). So the edge (or boundary) effects might confound the test results if the null pattern is the CSR 



independence. Two correction methods for the edge effects on NNCT-tests, namely buffer zone correction 
and toroidal correction, are investigated in (|Cevhanl ([20o3)) where it is shown that the empirical sizes of 



the NNCT-tests are mildly affected by the toroidal edge correction. However, the (outer) buffer zone edge 
correction method seems to have slightly stronger influence on the tests compared to toroidal correction. But 
for these tests, buffer zone correction doe s not change t he siz es significantly for most sample size combinations. 
This is in agreement with the findings of lBarot et all (|l999l ) who say NN methods only require a small buffer 



area around the study region. A large buffer area does not help much since one only needs to be able to see 
far enough away from an event to find its NN. Once the buffer area extends past the likely NN distances (i.e., 
about the average NN distances), it is not adding much helpful information for NNCTs. Hence we recommend 
inner or outer buffer zone correction for NNCT-tests with the width of the buffer area being about the average 
NN distance. We do not recommend larger buffer areas, since they are wasteful with little additional gain. 
On the other hand, we recommend the use of toroidal edge correction with points within the average NN 
distance in the additional copies around the study region. For larger distances, the gain might not be worth 
the effort. 

NNCT-tests summarize the pattern in the data set for small scales, more specifically, they provide infor- 
mation on the pattern around the average NN distance between all points. On the other hand, pair correlation 
function g{t) and Ripley's classical K or L-functions and other variants provide information on the pattern at 
various scales. However, the classical L-function is not appropriate for the null patter n of RL when locations 
of the points have spatial inhomogeneity. For such cases, Diggle's D-function (|Digglel (|2003[ ) p. 131) IS more 



appropriate in testing the bivariate spatial clustering at various scales. 



Ripley's classical K{t) or L{t) functions can be used when the null pattern can be assumed to be CSR 
independence, that is when the null pattern assumes first-order homogeneity for each cla ss. When the nu ll 
pattern is the RL of points from an inhomogeneous Poisson process they are not appropriate (Kulldorfl ( 2006f )): 



Cuzick-Edward's /c-NN tests are designed for testing bivariate spatial interaction and mostly used for spatial 
cluste ring of cases in epidemiology; Diggle's D-function is a modified version of Ripley's iiT- function ( Diggld 



( 2003f )) and adjusts for any inhomogeneity in the locations of, e.g., cases and control s. Furthermore, there are 



variants of K{t) that explicitly correct for inhomogeneity (see lBaddelev et aL I (I2OOOI) ). Ripley's K-, Diggle's 



-D-functions and pair correlation functions are designed to analyze univariate or bivariate spatial interaction 
at various scales (i.e., inter-point distances). Our example illustrates that for distances around the average 
NN distance, NNCT-tests and Ripley's bivariate L-function yield similar results. 

The NNCT-tests and Ripley's L-function provide similar information in the two-class case at small scales. 
For g-class case with q > 2 classes, overall tests provide information on the (small-scale) while the Ripley's 
L-function requires performing all bivariate spatial interaction analysis. The cell-specific tests can serve as 
pairwise post hoc analysis only when the overall test is significant. Furthermore, the cell-specific tests are 
testing the spatial clustering of one class or bivariate interaction between two classes as part of the multivariate 
interaction between all the classes. On the other hand, Ripley's univariate K- or L-functions are restricted to 
one class and bivariate K- or L-functions are restricted to two classes they pertain to, ignoring the potentially 
important multivariate interaction between all classes in the study area. However, there are forms of the J- 
function which is derived from the well-known G and F functions ( van Lieshout and Baddelev ( 1999[ )) and 



deal wi th this multi-type setting (i.e . , cons ider the pattern of type i in the context of the pattern of all other 
types). Ivan Lieshout and Baddelev ( 1999t ) define two basic types of J- functions. First is a type-i-to-type-j 



function which considers the points of type i in the context of the points of type j. The second one is the 
type-i-to-any-type function which considers the points of type i in the context of points of all types including 
type i. Other forms can be derived from them by re-defining the types. For example, if we want to consider 
the points of type i in the context of points of all other types, then we collapse all the other types j (i.e., all 
j which are not equal to i) into a single type i' and then use the type-i-to-type-i' fun ction. Several authors 



have written about the bi variate iT- fun ction, which is of the type-i-to-type-j form (jPiggle and Chetwvnd 



(ll99ll) . lHaas3 (Il995l) . and|Diggld tOO^ ). Type-i-to-type-j iiT-function can easily be modified to type-i-to- 



any-type if-function. Thus essentially there is only one family of multi-type i^-functions in literature. But 
type-i-to-type-j if-function is comparable with a NNCT analysis based on a 2 x 2 NNCT restricted to the 
classes i and j. Similarly, type-i-to-type-i' /-C- function is comparable with the NNCT analysis based on a 
2x2 NNCT with classes i and the rest of the classes labeled as i'. Since pairw ise ana l ysis o f q classes 
with 2x2 NNCTs might yield conflicting results compared to q x q NNCT analysis ( Dixon ( 20021 )). Ripley's 



L-function and NNCT-tests might also yield conflicting results at small distances. Hence Ripley's L-function 
and NNCT-tests may provide similar but not identical information about the spatial pattern and the latter 
might provide small-scale interaction that is not detected by the former. Since the pair correlation functions 
are derivatives of Ripley's if-function, most of the above discussion holds for them also, except g{t) is reliable 
only for large scale interaction analysis. Hence NNCT-tests and pair correlation function are not comparable 
but provide complimentary information about the pattern in question. 

Cell-specific tests for diagonal cells in a NNCT and Ripley's univariate K- or L-functions (and hence pair 
correlation functions) are symmetric, as they measure the spatial clustering of one class only. On the other 
hand, Ripley's bivariate K- or L-functions and pair correlation functions are symmetric in the two classes 
they pertain to. But cell-specific tests for two classes (i.e., off-diagonal cells in the NNCT) are not symmetric. 
Hence, at small scales, the cell-specific test for an off-diagonal cell, provides the type and different levels of 
spatial interaction for the corresponding two classes, while Ripley's L-function and pair correlation function 
provide only the type of spatial interaction, but can not distinguish the class-specific level of interaction for 
each of the two classes in question. 

For a data set for which CSR independence is the reasonable null pattern, we recommend the overall 
segregation test if the question of interest is the spatial interaction at small scales (i.e., about the mean NN 
distance). If it yields a significant result, then to determine which pairs of classes have significant spatial 
interaction, the cell-specific tests can be performed. One can also perform Ripley's K or L-function and 
only consider distances up to around the average NN distance and compare the results with those of NNCT 
analysis. If the spatial interaction at higher scales is of interest, pair correlation function is recommended 



( Loosmore and Ford 

On the other hand, if the RL pattern is the reasonable null pattern for the data, we recommend the NNCT- 
tests if the small-scale interaction is of interest and Diggle's _D-function if the spatial interaction at higher 
scales is also of interest. 
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Table 5: The empirical significance levels in the three-class case under Ho : RL for RL Cases (1) and (2) 
with Nmc = 10000, ni,n2,n3 in {10,30,50,100} at the nominal level of a = .05. C': the empirical size is 
significantly smaller than .05; i.e., the test is conservative. ^: the empirical size is significantly larger than 
.05; i.e., the test is liberal, cell = ccU-spccific test, C = overall test.) 



Rejection Rates of the NNCT- Tests Under Various PCP and Inhomogeneous Patterns 
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IPCP{X{x, y) = rii fi{x, y), (0, 1) x (0, 1)) with 1 = 1,2 for X and Y points, respectively 
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Table 6: The rejection rates for the NNCT-tests under various patterns different from GSR and RL. PCP: Poisson Cluster Process, MCP: Matern Cluster 
Process, IPCP: Inhomogeneous Poisson Cluster Process. See Section [8] for details on these point processes. 
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.9975 


.9994 


100,100) 


.9998 


.9999 


1.000 


1.000 


1.000 


1.000 




10,10) 


.5144 


.5121 


.7324 


.7320 


.5817 


.7296 


10,30) 


.8873 


.7833 


.9402 


.9425 


.8787 


.9409 


10,50) 


.9353 


.8002 


.9699 


.9711 


.9528 


.9713 


30,30) 


.9929 


.9915 


.9990 


.9990 


.9969 


.9990 


30,50) 


.9999 


.9979 


1.000 


1.000 


.9997 


1.000 


50,50) 


.9999 


1.000 


1.000 


1.000 


1.000 


1.000 


50,100) 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


100,100) 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 



Table 7: The empirical power estimates for the tests under the segregation alternatives, Hg, Hg^ , and Hg^^ 
in the two-class case with Nmc ~ 10000, for some combinations of ni, n2 € {10, 30, 50} at a = .05. The power 
estimates that are not significantly different between Dixon's and the new cell specific tests are marked with 
an asterisk (*). For all others the larger power estimate is significantly larger than the other at a = .05. Pd 
and Pn stand for empirical power estimates for Dixon's and new overall tests, respectively. P{^ and P^^ stand 
for empirical power estimates for Dixon's and cell-specific tests, respectively, for cell (i, i) with i = 1, 2. 



Empirical power estimates under the association alternatives 





sizes 


Dixon's 


New 


Overall 














/3d 






(10,10) 


.1349 


.1776 


.1638 


.1689 


.1105 


.1792 


(10,30) 


.0002 


.4366 


.2575 


.2728 


.3007 


.2838 


(10,50) 


.0002 


.4947 


.0686 


.1071 


.3318 


.1536 


(30,30) 


.1413 


.2434 


.2110 


.2134 


.1697 


.2138 


(30,50) 


.1833 


.3984 


.3268 


.3314 


.2903 


.3335 


(50,50) 


.1149 


.2421 


.2151 


.2181 


.1738 


.2181 


(50,100) 


.1813 


.4411 


.3448 


.3497 


.3410 


.3534 


(100,100) 


.0853 


.2309 


.1720 


.1740 


.1677 


.1750 




(10,10) 


.2499 


.2569 


.2898 


.2900 


.1834 


.3006 


(10,30) 


.0000 


.6463 


.4919 


.5123 


.4956 


.5255 


(10,50) 


.0000 


.7062 


.1959 


.2699 


.5500 


.3418 


(30,30) 


.4053 


.4457 


.5267 


.5293 


.4141 


.5294 


(30,50) 


.4896 


.6957 


.7196 


.7239 


.6332 


.7258 


(50,50) 


.4034 


.4961 


.5824 


.5848 


.4616 


.5854 


(50,100) 


.5527 


.7991 


.8003 


.8043 


.7559 


.8070 


(100,100) 


.3868 


.5575 


.5944 


.5957 


.5013 


.5981 




(10,10) 


.3038 


.2918 


.3475 


.3471 


.2222 


.3554 


(10,30) 


.0000 


.7364 


.6115 


.6290 


.6003 


.6407 


(10,50) 


.0000 


.7907 


.2885 


.3718 


.6512 


.4522 


(30,30) 


.6092 


.6011 


.7308 


.7301 


.6157 


.7319 


(30,50) 


.7211 


.8491 


.9052 


.9072 


.8386 


.9082 


(50,50) 


.6842 


.6891 


.8289 


.8302 


.7285 


.8299 


(50,100) 


.8024 


.9442 


.9631 


.9640 


.9433 


.9648 


(100,100) 


.7207 


.7973 


.8831 


.8828 


.8030 


.8834 



and TJi^^ 



Table 8: The empirical power estimates for the tests under the association alternatives 7?^, H 
in the two-class case with TV^c = 10000, for some combinations of ni, n2 S {10, 30, 50} at a = .05. The power 
estimates that are not significantly different between Dixon's and the new cell specific tests are marked with 
an asterisk (*). For all others the larger power estimate is significantly larger than the other at a = .05. 



NN 



NN 





D.F. 


P.P. 


sum 




D.F. 


P.P. 




base 


137 
38 


23 
30 


160 

68 


base pp- 


86 % 
56 % 


15 % 

44 % 


70 % 
30 % 


sum 


175 


53 


228 


77% 


23 % 


100 % 



Table 9: The NNCT for Pielou's data (left) and the corresponding percentages (right), where the cell 
percentages are with respect to the row sums (i.e., species sizes) and the marginal percentages are with 
respect to the total sample size. D.F. = Douglas-firs, P.P. = ponderosa pines. 



Dixon's NNCT-Tests 
Overall test 

Cp = 19.67 (pa.y = .0001) 

Cell-specific tests 



The New NNCT-Tests 
Overall test 

Gn = 13.11 (pasy = .0003) 

Cell-specific tests 





D.F. 


P.P. 




D.F. 


P.P. 


D.F. 


4.36 


-4.36 




D.F. 


3.63 


-3.61 




(< .0001) 


(< .0001) 






(.0003) 


(.0003) 


P.P. 


-2.29 


2.29 




P.P. 


-3.63 


3.61 




(.0221) 


(.0221) 






(.0003) 


(.0003) 



Table 10: The overall and cell-specific test statistics for Dixon's NNCT-tests (left) and the new NNCT-tests 
(right) and the corresponding p- values (in parenthesis) based on asymptotic approximation for Pielou's data. 
D.F. = Douglas-firs, P.P. — ponderosa pines; Co and Cat stand for the value of Dixon's and new overall test 
statistic, respectively. Pasy stands for the p- value based on the asymptotic approximation of the tests. 



NN 





W.T. 


B.G. 


C.A. 


B.C. 


O.T. 


sum 


W.T. 


112 (52 %) 


40 (19 %) 


29 (13 %) 


20 (11 %) 


14 (9 %) 


215 (29 %) 


B.G. 


38 (19 %) 


117 (57 %) 


26 (13 %) 


16 (8 %) 


8 (4 %) 


205 (28 %) 


base C.A. 


23 (15 %) 


23 (15 %) 


82 (53 %) 


22 (14 %) 


6 (4 %) 


156 (21 %) 


B.C. 


19 (19 %) 


29 (30 %) 


29 (30 %) 


14 (14 %) 


7 (7 %) 


98 (13 %) 


O.T. 


7 (12 %) 


8 (13 %) 


5 (8 %) 


7 (12 %) 


33 (55 %) 


60 (8 %) 


sum 


199 (27 %) 


217 (30 %) 


171 (23 %) 


79 (11 %) 


68 (9 %) 


734 (100 %) 



Table 11: The NNCT for swamp tree data (left) and the corresponding percentages (right), where the cell 
percentages are with respect to the row sums and marginal percentages are with respect to the total size. 
W.T. = water tupelos, B.G. = black gums, C.A. = Carolina ashes, B.C. = bald cypresses, and O.T. = other 
tree species. 



Dixon's overall test 

Cp = 275.64 (pasy < .0001, p^, < .0001, p.and < -0001) 



Dixon's cell-specific tests 







W.T. 


B.G. 


C.A. 


B.C. 


O.T. 






6.39 


-3.11 


-2.87 


-1.82 


-0.94 


W.T. 


Pasy 
Pmc 
.Prand 


(< .0001) 
(< .0001) 
(< .0001) 


(.0019) 
(.0014) 
(.0015) 


(.0041) 
(.0043) 
(.0048) 


(.0682) 
(.0702) 
(.0670) 


(.3484) 
(.3489) 
(.3286) 






-3.44 


8.05 


-3.09 


-2.43 


-2.34 


B.G. 


Pasy 
Pmc 
Prand 


(.0006) 
(.0004) 
(.0005) 


(< .0001) 
(< .0001) 
(< .0001) 


(.0020) 
(.0014) 
(.0012) 


(.0150) 
(.0179) 
(.0172) 


(.0194) 
(.0192) 
(.0198) 






-4.05 


-3.73 


8.08 


0.28 


-2.04 


C.A. 


Pasy 
Pmc 
Prand 


(.0001) 
(< .0001) 
(< .0001) 


(.0002) 
(.0004) 
(.0001) 


(< .0001) 
(< .0001) 
(< .0001) 


(.7820) 
(.7810) 
(.7580) 


(.0410) 
(.0430) 
(.0409) 






-2.18 


-0.36 


2.04 


0.25 


-0.38 


B.C. 


Pasy 
Pmc 
Prand 


(.0295) 
(.0292) 
(.0293) 


(.7180) 
(.7129) 
(.6861) 


(.0418) 
(.0410) 
(.0419) 


(.8011) 
(.7601) 
(.7910) 


(.7008) 
(.6739) 
(.6555) 






-3.02 


-2.54 


-2.47 


-0.39 


10.77 


O.T. 


Pasy 
Pmc 
Prand 


(.0025) 
(.0028) 
(.0021) 


(.0112) 
(.0112) 
(.0121) 


(.0135) 
(.0115) 
(.0158) 


(.6952) 
(.6582) 
(.6490) 


(< .0001) 
(< .0001) 
(< .0001) 



Table 12: Test statistics and p- values for Dixon's overall and cell-specific tests and the corresponding p- values 
(in parentheses). W.T. = water tupelos, B.G. = black gums. C.A. = Carolina ashes. B.C. = bald cypress and 
O.T. = other tree species. Pasy, Pmc, and Prand stand for the p- values based on the asymptotic approximation, 
Monte Carlo simulation, and randomization of the tests, respectively. 



New overall test 

Cn = 263.10 (pa.y < .0001, Pmc < -0001, Prand < -0001) 



New cell-specific tests 







W.T. 


B.G. 


C.A. 


B.C. 


O.T. 






7.55 


-4.08 


-4.06 


-0.74 


-1.74 


W.T. 


Pasy 
Pmc 
J^rand 


(< .0001) 
(< .0001) 
(< .0001) 


(< .0001) 
(< .0001) 
(< .0001) 


(.0001) 
(< .0001) 
(< .0001) 


(.4584) 
(.4564) 
(.4645) 


(.0819) 
(.0860) 
(.0824) 






-3.04 


8.16 


-4.25 


-1.45 


-3.27 


B.G. 


Pasy 
Pmc 
Prand 


(.0023) 
(.0028) 
(.0018) 


(< .0001) 
(< .0001) 
(< .0001) 


(< .0001) 
(< .0001) 
(< .0001) 


(.1479) 
(.1550) 
(.1493) 


(.0011) 
(.0013) 
(.0008) 






-3.71 


-4.52 


7.96 


1.36 


-2.77 


C.A. 


Pasy 
Pmc 
prand 


(.0002) 
(.0001) 
(.0001) 


(< .0001) 

(.0001) 
(< .0001) 


(< .0001) 
(< .0001) 
(< .0001) 


(.1745) 
(.1776) 
(.1806) 


(.0056) 
(.0064) 
(.0064) 






-1.78 


0.00 


1.61 


0.89 


-0.82 


B.C. 


Pasy 
Pmc 
Prand 


(.0754) 
(.0702) 
(.0723) 


(.9977) 
(.9952) 
(.9958) 


(.1081) 
(.1098) 
(.1114) 


(.3725) 
(.3771) 
(.3796) 


(.4097) 
(.4105) 
(.4071) 






-2.72 


-2.90 


-2.94 


0.21 


10.71 


O.T. 


Pasy 
Pmc 
Prand 


(.0066) 
(.0060) 
(.0070) 


(.0037) 
(.0031) 
(.0036) 


(.0033) 
(.0027) 
(.0026) 


(.8335) 
(.8375) 
(.8354) 


(< .0001) 
(< .0001) 
(< .0001) 



Table 13: Test statistics and p- values for the new overall and cell-specific tests and the corresponding p- values 
(in parentheses). The labeling of the species and p- values are as in Table [T2] 



W.T. vs B.G. W.T. vs C.A. W.T. vs B.C. 




02468 10 12 02468 10 12 02468 10 12 



t{m) t(m) t(m) 



W.T. vs O.T. B.G. vs C.A. B.G. vs B.C. 




02468 10 12 02468 10 12 02468 10 12 




02468 10 12 02468 10 12 02468 10 12 



t{m) t(m) t(m) 



B.C. vs O.T. 




2 4 6 8 10 12 



t(m) 

Figure 12: Second-order analysis of swamp tree data. Functions plotted are Ripley's bivariate L- functions 
Lijit) — t for i, j = 1, 2, . . . , 5 and i ^ j where i stands for all data combined, i = 1 for water tupelos, 
i = 2 for black gums, i = 3 for Carolina ashes, i — A for bald cypresses, and i = 5 for other trees. Wide 
dashed lines around (which is the theoretical value) are the upper and lower (pointwise) 95 % confidence 
bounds for the L-functions based on Monte Carlo simulations under the CSR independence pattern. W.T. 
= water tupelos, B.G. = black gum, C.A. = Carolina ashes, B.C. = bald cypresses, and O.T. = other tree 
species. 



W.T. vs B.G. 



W.T. vs C.A. 



W.T. vs B.C. 




t(in) 



t(m) 



t(m) 



W.T. vs O.T. 




t(m) 
B.G. vs O.T. 




2 4 



t(m) 



B.G. vs C.A. 



■a , ^ 



^'^ 



2 4 



t(m) 
C.A. vs B.C. 



2 4 



t(m) 
B.C. vs O.T. 




t(m) 



B.G. vs B.C. 



1 1 
1 1 




2 4 6 

t(m) 



C.A. vs O.T. 




2 4 



t(m) 



Figure 13: Pair correlation functions for each pair of species in the swamp tree data. Wide dashed hnes 
around 1 (which is the theoretical value) are the upper and lower (pointwise) 95 % confidence bounds for the 
L-functions based on Monte Carlo simulations under the CSR independence pattern. W.T. = water tupelos, 
B.G. = black gums, C.A. = Carolina ashes, B.C. = bald cypresses, and O.T. = other tree species. 



